developer-rakeshpaul / scrapexLinks
Modern web scraper with LLM-enhanced extraction, extensible pipeline, and pluggable parsers.
☆10Updated 3 weeks ago
Alternatives and similar repositories for scrapex
Users that are interested in scrapex are comparing it to the libraries listed below
Sorting:
- Storex Core - A modular and portable database abstraction ecosystem for JavaScript☆155Updated 2 months ago
- Elasticsearch storage adapter for Gun DB☆29Updated last year
- Synced Files Workspace. A p2p collaborative filestructure built on Hypercore's Autobase.☆17Updated 4 years ago
- Compress json-data based on its json-schema while still having valid json☆98Updated this week
- An implementation of LevelDOWN that uses Amazon S3. Turn your S3 bucket into a DB☆62Updated 2 weeks ago
- A LevelUP compatible leaderless multi-master database with eventual consistency, using hyperbee + CRDT + HLC. Similarly CockroachDB achi…☆37Updated 5 years ago
- Files in Markdown☆16Updated last year
- Simple end-to-end encrypted, secure channels using Noise Protocol Framework and libsodium secretstream☆147Updated 3 years ago
- Multi-writer hypercore.☆138Updated 6 months ago
- A series of compact encoding schemes for building small and fast parsers and serializers☆25Updated 3 weeks ago
- A peer-to-peer data sync framework☆22Updated 5 years ago
- Bridging the gap between buffers and typed arrays☆45Updated 4 months ago
- Dat provider for Yjs☆47Updated 5 years ago
- Tiny module for easy encryption of Buffers☆32Updated 3 years ago
- Standalone Hyperspace RPC client☆35Updated 4 years ago
- A universal static type checking solution for use in flow-based-programming systems☆12Updated 8 years ago
- Micro-framework for building Gun adapters☆31Updated 8 years ago
- Auto installs npm dependencies from the script you want to run and runs the script☆47Updated last year
- A Leveldown-compliant backend for Hyperbee☆48Updated 5 years ago
- A tiny relay server that bridges two WebSocket connections, allowing the clients to talk directly to each other. (Formerly known as 🐟 Ce…☆112Updated last year
- Synchronize PouchDB or CouchDB with P2P Hypercores!☆33Updated 3 years ago
- Article content extraction database☆40Updated 2 years ago
- Proxy p2p connections using a duplex stream and Hyperswarm☆31Updated 3 years ago
- MMST is used to create spanning trees in P2P networks while minimizing connections per node☆49Updated 4 years ago
- Make any iterator or iterable abortable via an AbortSignal☆16Updated last year
- In-memory abstract-level database for Node.js and browsers.☆36Updated last month
- A language and specification for building data queries using key-value pairs☆59Updated 3 years ago
- The Hyperswarm discovery stack☆120Updated 5 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆56Updated 2 years ago
- Run SSH over hyperswarm!☆154Updated 10 months ago