clemlesne / scrape-it-now
Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.
☆515Updated 2 months ago
Alternatives and similar repositories for scrape-it-now:
Users that are interested in scrape-it-now are comparing it to the libraries listed below
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆848Updated 6 months ago
- Detect whether or not an audio file was generated by NotebookLM☆133Updated 4 months ago
- ☆784Updated this week
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆919Updated this week
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆407Updated this week
- Weave your codebase into a single, navigable Markdown document☆419Updated last month
- Create mind maps to learn new things using AI.☆543Updated 5 months ago
- Open-source framework for exporting your personal data.☆1,426Updated 3 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆496Updated this week
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆222Updated 3 months ago
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programs☆327Updated 2 months ago
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆409Updated last week
- Browser MCP is a Model Context Provider (MCP) server that allows AI applications to control your browser☆884Updated last week
- ☆162Updated 9 months ago
- A self-hosted API that takes a URL and returns a file with browser screenshots.☆941Updated last month
- ☆427Updated last week
- MCP server for fetch web page content using Playwright headless browser.☆561Updated this week
- Animating R1's thoughts.☆372Updated last month
- Attempt to create an Open Source Privacy Focused Rewind.ai Alternative for data capture☆204Updated 2 months ago
- ☆436Updated 6 months ago
- LetterDrop is a secure and efficient newsletter management service powered by Cloudflare Workers, enabling easy creation, distribution, a…☆306Updated 9 months ago
- The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.☆437Updated this week
- Visualise your CSV files in seconds without sending your data anywhere☆506Updated 3 weeks ago
- Kilo Code (forked from Roo Code) gives you a whole dev team of AI agents in your code editor.☆131Updated this week
- Examples and guides for using the VLM Run API☆270Updated 3 weeks ago
- Claude Memory: Long-term memory for Claude☆411Updated last month
- A MCP server implementation for hyperbrowser☆208Updated this week
- DOM to Semantic-Markdown for use with LLMs☆811Updated 2 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆516Updated last week
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP☆442Updated last week