Data Extraction Experts
8 billion pages crawled / month
300+ billion pages in total
Open Source - No Vendor Lock In
40+ Open Source projects
29k+ Github stars for Scrapy
Loved by Developers & Customers
2000+ Companies as customers
Used by over 1M+ developers
Founders Shane Evans and Pablo Hoffman met at a time when more and more companies needed web data to help power their business. With no external solutions available they built their own data extraction software.
Scrapinghub was founded, a company to provide services that go beyond spider development and provide a holistic and scalable data extraction solution with a community of web scraping experts.
Customers wanted to run and manage their own spiders. So Scrapy Cloud was released, a dedicated hosting service specifically designed for running and managing web scraping spiders in the cloud.
Traditional proxy services weren’t giving our customers the reliability they needed to extract data at scale. It was at this point Crawlera was born, an endpoint proxy solution that automated proxy management.
To continue to support the need for data, a non-developer tool was created to allow our customers to easily get data from a limited number of websites with no coding needed.
Some customers wanted access to only the data - they didn’t want to worry about spiders, proxies, QA, etc. so we launched our managed data services, customers get the data when they need it.
Launched our dedicated enterprise account management service, where we engage in deep partnerships with our enterprise customers to deliver full service solutions with access to data quality assurance, enterprise service-level agreements and maintenance plans. Full access to project’s code with training.
Every day over 2,000 companies (many Fortune 500’s) and 1 million developers rely on our tools and services to get the data they need to grow their businesses.
Our time-sensitive study involved compiling data on the 2015 Canadian federal election. Scrapinghub scraped about 50 websites daily over the period of a month. We weren’t familiar with many of the technical aspects of scraping but the Scrapinghub team patiently and promptly answered our questions. The data we needed were delivered on time and in a format that helped make our research project a success.
Scrapinghub's senior engineers built a secure and reliable solution for online multi-platform ticket bookings that enabled us to enter the market quickly. Their technology gave us a scalable platform that helped us achieve our business goals. It has been a great experience and I truly recommended working with them.
Hiring Scrapinghub and building our next-generation scraping system on open source Scrapy and Scrapyd are some of the best decisions we've made. Scrapy has been accurate, reliable, easy to maintain and ScrapingHub people have been a joy to work with.
Gilberto's (Engineer at Scrapinghub) expertise and resourcefulness has been fantastic as he has supported our service. He knows how to talk to those of us who aren't technical and he consistently produces high quality results. We trust his recommendations completely due to his high integrity and superior service.
I was very pleased with both the finished project and with the way my questions were promptly answered. This is definitely the way to run a successful company. Thanks!
Scrapinghub isn’t your “typical” company. We know that talent doesn’t have borders so we are a 100% remote company with people in every corner of the world. No matter where you are there is a Shubber (what we call ourselves) not too far away.
For full listing click here
We are a tribe of highly skilled, diverse individuals forming a globally distributed community of over 135 Shubbers from over 30 countries working in professional services, sales, support, and sponsored Open Source work.