We offer premium technology services for business.
DevsData LLC is a boutique software & recruitment agency, with Google-level engineers and a vast network of senior expert contractors.
You name the source and data to extract and we will come up with a tailor-made solution adjusted to your needs.
Our engineers have an in-depth understanding of complex databases and broad experience in processing them.
We are able to extract data from multiple challenging sources, even scrape-proof websites. To achieve such results, we use the most advanced tech solutions such as:
We designed and maintained several small scrapers for a company project on short notice. Our task was to extract the data as quickly as possible and filter it to obtain only the essential information. One of the projects was a Natural Language Processing scraping engine for a London-based hedge fund – it scraped and scored news articles based on precise criteria given by the client.
We created a scraper for a US-based client. It required as few requests as possible to collect responses for 300m SSNs under the protected form on the website. The obvious choice for scraping technology was the low-level request package.
The system was set on ten small machines on a Google cloud.
We worked on extracting data from Filmweb – the second biggest movie database in the world. It required as few requests as possible to collect all data about every movie/TV series on the website. The data was stored deep inside HTML. Beautifulsoup was used to collect essential information and parts of the website.
We always make sure to be on the same page with our clients as we strongly believe that communication is the key to fruitful cooperation.
Most of our specialists work remotely from our European office, however, we are open to permanent, cross-border relocation of selected engineers. For longer projects, we usually start full-time engagement with 2 weeks of onboarding, locally at the client’s office.
We created a scraper running on Wikipedia to collect a data set regarding movies/television series and their cast. The biggest threat in this project was that the website was non-structured, so links to other subpages could have been located everywhere. Scrapy, which memorizes visited subpages and schedules pages to visit, was the most efficient technology to use.
We took part in the maintenance and modification process of the company’s scraping engine. It was responsible for collecting profile data about people and companies from about ten confidential sources. The data had been purchased before, so our task was to collect what was either not yet available to buy or to extend the possessed data.
Our client needed to collect data on clothing products, with the main focus being their categorization and prices. There were about 30 websites with varying depth of information and protection against scraping.
DevsData – a premium technology partner
DevsData is a boutique tech recruitment and software agency. Develop your software project with veteran engineers or scale up an in-house tech team with developers with relevant industry experience.
Free consultation with a software expert
🎧 Schedule a meeting
“DevsData LLC is truly exceptional – their backend developers are some of the best I’ve ever worked with.”
Nicholas Johnson
Mentor at YC,
Ex-Tesla engineer,
Serial entrepreneur
Categories: Big data, data analytics | Software and technology | IT recruitment blog | IT in Poland | Content hub (blog)
general@devsdata.com
“I interviewed about a dozen different firms. DevsData LLC is truly exceptional – their backend developers are some of the best I’ve ever worked with. I’ve worked with a lot of very well-qualified developers, locally in San Francisco, and remotely, so that is not a compliment I offer lightly. I appreciate their depth of knowledge and their ability to get things done quickly. “
Nicholas Johnson
CEO of Orange Charger LLC,
Ex-Tesla Engineer,
Mentor at YCombinator