lperezmo / embeddings-extractionLinks
Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.
☆13Updated last year
Alternatives and similar repositories for embeddings-extraction
Users that are interested in embeddings-extraction are comparing it to the libraries listed below
Sorting:
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 3 years ago
- create workflows with LLMs☆55Updated last year
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆23Updated 2 years ago
- large language model for mastering data analysis using pandas☆48Updated 2 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆23Updated last year
- Split and analyze text files using langchain and streamlit☆50Updated last year
- Python client library for improving your LLM app accuracy☆97Updated last year
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆83Updated last month
- ☆57Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated 2 years ago
- Source code of the food discovery demo built on top of Qdrant☆49Updated 2 years ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated 2 years ago
- ☆66Updated last week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Implementing the Chain Of Density text summarisation technique from recent NLP research by researchers at Salesforce, MIT, Columbia, etc.…☆78Updated 10 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆29Updated 2 years ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆37Updated 2 years ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- ☆54Updated 3 weeks ago
- ☆12Updated last month
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Updated last year
- Develop, evaluate and monitor LLM applications at scale☆100Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Updated 3 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- Record and replay LLM interactions for langchain☆82Updated last year
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Updated last month