repollo / llm_data_parser
This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.
☆29Updated last year
Alternatives and similar repositories for llm_data_parser:
Users that are interested in llm_data_parser are comparing it to the libraries listed below
- A simple tool that serves as a knowledge graph explorer utilizing the GPT 3.5 turbo model to help users explore information in an organiz…☆58Updated 7 months ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago
- S3 vector database for LLM Agents and RAG.☆37Updated last year
- LLM plugin for clustering embeddings☆71Updated last year
- Example of running LangChain on Cloud Run☆61Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆49Updated 3 weeks ago
- LLM-ready data connectors☆75Updated 10 months ago
- Lightweight and Flexible Library for Creating Agents and Multi-Agent Conversations 🤖☆24Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆25Updated last month
- get structured output from LLM's☆32Updated last year
- Run embedding models using ONNX☆31Updated last year
- Visual Studio Code extension to convert HTML to FastHTML FT☆18Updated last month
- Lightweight Python implementation of LingoNaut for multilingual language learning.☆17Updated last year
- A pattern to let you try several vector databases and change a little code as possible☆38Updated last year
- Chatroom app where messages are sent to GPT, Claude, Mistral, Together, Grok, Groq, Google, vLLM, Ollama & streamed to the frontend.☆39Updated 3 weeks ago
- A personal knowledge base that I can dump information to and help me learn☆24Updated 9 months ago
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.☆38Updated last month
- Generate dynamic UI forms from text using OpenAI's structured output API☆54Updated 8 months ago
- ☆27Updated 6 months ago
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆22Updated 5 months ago
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆49Updated 3 months ago
- Python SDK for Browserbase☆32Updated this week
- Web scraping API for building AI applications.☆41Updated last year
- ☆10Updated last year
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated 10 months ago
- ☆101Updated 11 months ago
- Plugin for LLM adding support for Google's PaLM 2 model☆14Updated last year
- FalkorDB Python Client☆18Updated 3 weeks ago
- Embedding models from Jina AI☆58Updated last year