raznem / parsera
Lightweight library for scraping web-sites with LLMs
β1,083Updated 3 weeks ago
Alternatives and similar repositories for parsera
Users that are interested in parsera are comparing it to the libraries listed below
Sorting:
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLiteβ938Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.β1,658Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β945Updated 2 weeks ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ544Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,247Updated this week
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.aiβ1,349Updated 9 months ago
- ContextGem: Effortless LLM extraction from documentsβ914Updated last week
- Fast State-of-the-Art Static Embeddingsβ1,615Updated this week
- Real-time data transformation framework for AI. Ultra performant, with incremental processing.β1,260Updated this week
- π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking libraryβ932Updated this week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β915Updated 3 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,152Updated this week
- β Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are complβ¦β439Updated last month
- openperplex is an opensource AI search engineβ857Updated 9 months ago
- 90% of what you need for LLM app development. Nothing you don't.β260Updated 3 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β1,856Updated this week
- Generic rag framework to apply the power of LLMs on any given datasetβ602Updated this week
- A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAGβ383Updated 9 months ago
- Structured information extraction from documentsβ314Updated 7 months ago
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3β1,301Updated 4 months ago
- Tools to build web AI agents that can authenticate, interact with and extract data from any website.β284Updated 4 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β579Updated 2 months ago
- The Knowledge Platform - expedite delivery of your knowledge to AI. Build, ship, and manage anywhere from local, cloud, on-prem, or edge.β388Updated this week
- Make any LLM to think like OpenAI o1 and deepseek R1β488Updated 3 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,110Updated last week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Pythonβ¦β1,360Updated 3 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.β1,106Updated 4 months ago
- No-code ETL and data pipelines with AI and NLPβ311Updated 2 months ago
- A system for agentic LLM-powered data processing and ETLβ1,947Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI appβ1,739Updated this week