Autoscraping beta

Overview

Autoscraping is a tool that allows you to scrape web sites without any programming knowledge. You just annotate web pages to tell the scraper where to extract each field (name, description, title) from, and the system does all the rest. And it's all web based, so all you need is a modern browser, there's no need to download and install anything.

Tour

The following screencast video provides a good introduction to what this service can do

This blog post contains a screencast video that shows how the tool works.

Open source

The core extraction logic of the Autoscraping system is based on the Scrapely open source library.

Autoscraping is powered by the Slybot crawler, an open source integration of the Scrapely extraction library and the Scrapy web crawling framework. This is the core technology behind the service and you can export Autoscraping projects from Scrapinghub to be run in slybot, on any machine, allowing our users to have the flexibility and freedom provided by open source.

Pricing

See our pricing page.

Sign up

Autoscraping is currently in private beta. To sign up, fill this form and someone from our team will contact you in 1-3 business days.