raznem / parseraLinks
Lightweight library for scraping web-sites with LLMs
β1,227Updated last month
Alternatives and similar repositories for parsera
Users that are interested in parsera are comparing it to the libraries listed below
Sorting:
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,086Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,430Updated last month
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,882Updated 2 weeks ago
- β Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are complβ¦β524Updated 4 months ago
- openperplex is an opensource AI search engineβ876Updated last year
- Tools to build web AI agents that can authenticate, interact with and extract data from any website.β299Updated 9 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ556Updated 4 months ago
- A system for agentic LLM-powered data processing and ETLβ2,957Updated this week
- The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.β856Updated 2 months ago
- Get clean data from tricky documents, powered by vision-language models β‘β1,303Updated 3 weeks ago
- the simplest self-building coding agentβ1,044Updated 11 months ago
- Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!β659Updated 7 months ago
- β¨ AI-powered markdown editor - leverage LLMs with your documents - 100% local or in the cloudβ1,264Updated this week
- Turn Your Content Into a 24/7 AI Support Assistantβ730Updated 3 months ago
- ContextGem: Effortless LLM extraction from documentsβ1,516Updated last week
- Prompt optimization scratchβ854Updated 5 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β933Updated 8 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β709Updated 7 months ago
- β¨ AI interface for tinkerers (Ollama, Haystack RAG, Python)β471Updated last month
- AI Browser Automationβ759Updated last week
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt β¦β932Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,319Updated 4 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,263Updated 5 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1β489Updated 8 months ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β798Updated last week
- No-code ETL and data pipelines with AI and NLPβ316Updated 7 months ago
- π An open-source alternative to OpenAI Operatorβ491Updated 2 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,534Updated this week
- the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8β562Updated last year
- The easiest way to get started with LlamaIndexβ1,459Updated 2 months ago