repollo / llm_data_parserLinks
This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.
☆29Updated 2 years ago
Alternatives and similar repositories for llm_data_parser
Users that are interested in llm_data_parser are comparing it to the libraries listed below
Sorting:
- Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable p…☆83Updated 11 months ago
- Use AI to personify books, so that you can talk to them 🙊☆18Updated 2 years ago
- A Pythonic integration for LLMs.☆88Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆27Updated 6 months ago
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆292Updated 3 months ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- Tools for building products and apps with LLMs.☆73Updated last year
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆73Updated last month
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆236Updated last year
- Langchain tools to search/extract/transcribe text transcripts of Youtube videos. Some of this has been integrated into LangChain main bra…☆74Updated 2 years ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆114Updated last month
- S3 vector database for LLM Agents and RAG.☆48Updated 2 years ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆115Updated last year
- A simple tool that serves as a knowledge graph explorer utilizing the GPT 3.5 turbo model to help users explore information in an organiz…☆59Updated 11 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 5 months ago
- agenty☆42Updated 6 months ago
- Private ChatGPT/Perplexity. Securely unlocks knowledge from confidential business information.☆71Updated 10 months ago
- Embedding models from Jina AI☆64Updated last year
- Built with Fast Dash, this app uses Embedchain, which abstracts the entire process of loading and chunking datasets, creating embeddings,…☆66Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆37Updated last year
- Get structured JSON data from any page.☆177Updated last year
- A ChatGPT UI for young readers, written by ChatGPT☆70Updated 2 years ago
- simplifies the process of creating and managing LLM workflows.☆108Updated 10 months ago
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3☆15Updated 5 years ago
- Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute☆79Updated 2 years ago
- Fully working applications that demonstrate how to use Haystack to implement various use cases☆130Updated 3 weeks ago
- Record and replay LLM interactions for langchain☆82Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆143Updated 8 months ago
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆28Updated last year