HawkClaws / main_content_extractorLinks
A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆53Updated last year
Alternatives and similar repositories for main_content_extractor
Users that are interested in main_content_extractor are comparing it to the libraries listed below
Sorting:
- Natural Language Interfaces Powered by LLMs☆95Updated last year
- Build reliable, secure, and production-ready AI apps easily.☆97Updated last month
- ☆51Updated last year
- Analyzing chat interactions w/ LLMs to improve 🦜🔗 Langchain docs☆86Updated 2 years ago
- Crawl and convert any website into clean markdown☆70Updated last year
- The official Python library for the Steel API☆28Updated this week
- converts url content into JSON with a simple prefix☆72Updated last year
- ☆38Updated last year
- ☆26Updated last year
- CLI to generate LangGraph stubs from a specification☆103Updated 10 months ago
- Example LangGraph flow that does "competitor analysis" on the web.☆38Updated last year
- Open-source RAG evaluation through users' feedback☆214Updated last year
- ☆33Updated 2 years ago
- Repository for deepdoctection tutorial notebooks☆48Updated 2 weeks ago
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆156Updated last year
- ☆95Updated 2 years ago
- Excel spreadsheet crawler and table parser for data extraction and querying☆164Updated 10 months ago
- Own your AI, search the web with it🌐😎☆94Updated last year
- This is a proof of concept repo on how to create a gradio UI using the Model Context Protocol Client Python SDK.☆67Updated last year
- A Function Calls Proxy for Groq, the fastest AI alive!☆207Updated last year
- Self-hosted version of Microsoft's OmniParser Image-to-text model☆81Updated 7 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆147Updated last year
- Not Diamond Python SDK☆89Updated last month
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated 2 years ago
- Data Questionnaire Agent Chatbot☆71Updated last week
- CAMEL framework-based multi-agent system for task-driven and dynamic environments☆105Updated last year
- Private ChatGPT/Perplexity. Securely unlocks knowledge from confidential business information.☆77Updated last year
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Updated 2 years ago
- ☆151Updated last year
- Uses Langchain to semantic search over a chat conversation☆38Updated 2 years ago