The Scrapy Cloud API (often also referred as the Scrapinghub API) is a HTTP API that you can use to control your spiders and consume the scraped data, among other things.


It is the recommended way to consume scraped data from spiders run on Scrapinghub, regardless of whether they're built with Scrapy or Portia. You can use tags to mark jobs consumed and skip them on next reads.

For more information, please refer to the API reference documentation here:
https://doc.scrapinghub.com/api/overview.html