repollo / llm_data_parserLinks
This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.
☆30Updated 2 years ago
Alternatives and similar repositories for llm_data_parser
Users that are interested in llm_data_parser are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable p…☆85Updated last year
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆295Updated 5 months ago
- Docx tracked change redlines for the Python ecosystem.☆87Updated last year
- Private ChatGPT/Perplexity. Securely unlocks knowledge from confidential business information.☆72Updated last year
- Visual Studio Code extension to convert HTML to FastHTML FT☆19Updated 8 months ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated 2 years ago
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3☆15Updated 5 years ago
- A simple tool that serves as a knowledge graph explorer utilizing the GPT 3.5 turbo model to help users explore information in an organiz…☆59Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆26Updated 8 months ago
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆75Updated last week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆116Updated 3 months ago
- Record and replay LLM interactions for langchain☆82Updated last year
- simplifies the process of creating and managing LLM workflows.☆110Updated last year
- Embedding models from Jina AI☆65Updated last year
- A Python micro framework for creating LLM-driven agents☆23Updated 5 months ago
- Fully automated AI based web scraping.☆28Updated 8 months ago
- Spider ported to Python☆94Updated 9 months ago
- scraping and querying documents for LLMs☆24Updated 3 weeks ago
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆38Updated 9 months ago
- ☆13Updated last year
- LLM-ready data connectors☆95Updated last year
- Open Source LLMOps tool for AI teams☆128Updated 8 months ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆40Updated last year
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆143Updated this week
- ☆59Updated 2 years ago
- new skills taxonomy using TextKernel data☆35Updated 3 years ago
- Repo to experiment with Graph RAG strategies using Kùzu☆59Updated last month
- Import unstructured data (text and images) into structured tables☆156Updated last week