HawkClaws / main_content_extractor
A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆34Updated 9 months ago
Alternatives and similar repositories for main_content_extractor:
Users that are interested in main_content_extractor are comparing it to the libraries listed below
- Crawl and convert any website into clean markdown☆46Updated 9 months ago
- Clone of https://r.jina.ai which is deployable locally☆39Updated 5 months ago
- Multimodal LLM Application with PyMuPDF4LLM☆35Updated 4 months ago
- ☆26Updated last year
- More than chat. Evelyn is an AI tutor that engages.☆40Updated 10 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆93Updated 7 months ago
- ☆30Updated 2 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- Split and analyze text files using langchain and streamlit☆47Updated 9 months ago
- ElasticSearch agent based on ElasticSearch, LangChain and ChatGPT 4☆44Updated last year
- Generate ChatGPT function call schemas based on function docstrings.☆57Updated last year
- POC Port of the openai-realtime-console to streamlit.☆45Updated 4 months ago
- RAG Citation enhances Retrieval-Augmented Generation (RAG) by automatically generating relevant citations for AI-generated content. It en…☆25Updated 3 months ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆48Updated 11 months ago
- ☆11Updated last year
- Making LLM Tool-Calling Simpler.☆23Updated 5 months ago
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆147Updated 11 months ago
- ☆23Updated 6 months ago
- Build reliable, secure, and production-ready AI apps easily.☆62Updated this week
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆142Updated 10 months ago
- Routing on Random Forest (RoRF)☆123Updated 5 months ago
- LLM-ready data connectors☆71Updated 9 months ago
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆65Updated 3 weeks ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆42Updated last year
- Zep: Long-Term Memory for AI Assistants (Python Client)☆88Updated this week
- Natural Language Interfaces Powered by LLMs☆91Updated 7 months ago
- ☆44Updated 7 months ago
- OpenAI document chatbot using llama-index, pinecone and chainlit. With incremental features, giving you the tools to go from a basic RAG …☆67Updated 10 months ago
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆45Updated last week
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 2 months ago