跳转到主要内容
Avatar for scrapy from gravatar.com
用户名    scrapy

32 个项目

scrapyd

Last released

A service for running Scrapy spiders, with an HTTP API

itemloaders

Last released

Base library for scrapy's ItemLoader

scrapy-zyte-smartproxy

Last released

Scrapy middleware for Zyte Smart Proxy Manager

scrapy-poet

Last released

Page Object pattern for Scrapy

form2request

Last released

Build HTTP requests out of HTML forms

w3lib

Last released

Library of web-related functions

Scrapy

Last released

A high-level Web Crawling and Web Scraping framework

itemadapter

Last released

Common interface for data container classes

queuelib

Last released

Collection of persistent (disk-based) and non-persistent (memory-based) queues

parsel

Last released

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Protego

Last released

Pure-Python robots.txt parser with support for modern conventions

web-poet

Last released

Zyte's Page Object pattern for web scraping

xtractmime

Last released

Implementation of the MIME Sniffing standard (https://mimesniff.spec.whatwg.org/)

andi

Last released

Library for annotation-based dependency injection

scrapy-splash

Last released

JavaScript support for Scrapy using Splash

scrapyd-client

Last released

A client for Scrapyd

cssselect

Last released

cssselect parses CSS3 Selectors and translates them to XPath 1.0

scrapy-deltafetch

Last released

Scrapy middleware to ignore previously crawled pages

splash

Last released

A javascript rendered with a HTTP API

scrapely

Last released

A pure-python HTML screen-scraping library

scrapy-po

Last released

Page Object pattern for Scrapy

webstruct

Last released

A library for creating statistical NER systems that work on HTML data

PyPyDispatcher

Last released

Multi-producer-multi-consumer signal dispatching mechanism

adblockparser

Last released

Parser for Adblock Plus rules

loginform

Last released

Fill HTML login forms automatically

scrapy-splitvariants

Last released

Scrapy spider middleware to split an item into multiple items on a multi-valued key

scrapy-hcf

Last released

Scrapy spider middleware to use Scrapinghub's Hub Crawl Frontier as a backend for URLs

scrapy-querycleaner

Last released

Scrapy spider middleware to clean up query parameters in request URLs

scrapy-magicfields

Last released

Scrapy middleware to add extra "magic" fields to items

scrapy-djangoitem

Last released

Scrapy extension to write scraped items using Django models

scrapyjs

Last released

JavaScript support for Scrapy using Splash

scrapy-jsonrpc

Last released

Scrapy extenstion to control spiders using JSON-RPC

支持

AWSAWS云计算和安全赞助商DatadogDatadog监控FastlyFastlyCDNGoogleGoogle下载分析MicrosoftMicrosoftPSF赞助商PingdomPingdom监控SentrySentry错误记录StatusPageStatusPage状态页