八爪鱼爬虫是一款功能强大的 Chrome 浏览器数据采集扩展插件。它提供智能采集和高级采集两种模式,帮助用户轻松从网页中提取所需数据。插件支持可视化规则编辑,通过图形化界面直观地配置数据提取规则,无需编写复杂代码。内置任务管理系统,支持定时任务(Cron)自动执行,让数据采集更加自动化。采集的数据可导出为 Excel、CSV、JSON 等多种格式,方便后续分析和使用。插件采用现代化的技术栈开发,界面简洁美观,操作便捷高效,是数据分析、内容监控、信息收集等场景的理想工具。
AutoWDS
Popular repositories Loading
-
-
-
trafilatura
trafilatura PublicForked from adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Python 1
-
sde
sde PublicForked from seagatesoft/sde
Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignment (DEPTA) method. (UPDATE: I implemented a newer algorithm…
Java
-
-
mdr
mdr PublicForked from scrapinghub/mdr
A python library detect and extract listing data from HTML page.
C
Repositories
- autowds-backend Public
AutoWDS/autowds-backend’s past year of commit activity - .github Public
AutoWDS/.github’s past year of commit activity - autowds-instance-demo Public
AutoWDS/autowds-instance-demo’s past year of commit activity - apalis Public Forked from apalis-dev/apalis
Simple, extensible multithreaded background job and message processing library for Rust
AutoWDS/apalis’s past year of commit activity - browser Public Forked from lightpanda-io/browser
Lightpanda: the headless browser designed for AI and automation
AutoWDS/browser’s past year of commit activity - crawlee Public Forked from apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
AutoWDS/crawlee’s past year of commit activity - ape-dts Public Forked from apecloud/ape-dts
ApeCloud's Data Transfer Suite, written in Rust. Provides ultra-fast data replication between MySQL, PostgreSQL, Redis, MongoDB, Kafka and ClickHouse, ideal for disaster recovery (DR) and migration scenarios.
AutoWDS/ape-dts’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…