HawkClaws / main_content_extractorLinks
A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆41Updated last year
Alternatives and similar repositories for main_content_extractor
Users that are interested in main_content_extractor are comparing it to the libraries listed below
Sorting:
- Build reliable, secure, and production-ready AI apps easily.☆73Updated last week
- ☆28Updated 2 years ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆145Updated last year
- ☆32Updated last year
- Natural Language Interfaces Powered by LLMs☆91Updated 10 months ago
- CLI to generate LangGraph stubs from a specification☆79Updated 3 months ago
- LLM-ready data connectors☆83Updated last year
- RAG Citation enhances Retrieval-Augmented Generation (RAG) by automatically generating relevant citations for AI-generated content. It en…☆37Updated 7 months ago
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆58Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 9 months ago
- Full stack advanced chatbot over LlamaIndex.TS documentation with preview feature using Multi-documents-agents, bootstrapped with create-…☆151Updated last year
- ☆225Updated 2 weeks ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- AI tool that annotates research papers and shows related articles and videos for better understanding☆42Updated last month
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- An OpenAI Completions API compatible server for NLP transformers models☆65Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆71Updated 7 months ago
- A UI to view your chromaDB quickly.☆26Updated last year
- Not Diamond Python SDK☆81Updated last week
- Open-source RAG evaluation through users' feedback☆188Updated last year
- This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking task…☆69Updated last year
- POC Port of the openai-realtime-console to streamlit.☆50Updated 8 months ago
- Pinecone text client library☆62Updated 3 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- Use dynamic few-shot selection to tweet in a particular style☆29Updated 9 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- ☆89Updated last year
- A list of AI memory projects☆146Updated 5 months ago