lperezmo / embeddings-extraction
Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.
☆13Updated 7 months ago
Alternatives and similar repositories for embeddings-extraction:
Users that are interested in embeddings-extraction are comparing it to the libraries listed below
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated 10 months ago
- A narrow implementation of DiagramGPT for generating system architecture diagrams with local LLM models and Llama.cpp☆24Updated 10 months ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 8 months ago
- time based thinking and structure like OpenAI's o1 preview.☆10Updated 6 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 2 weeks ago
- Rust bindings for CTranslate2☆14Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆16Updated last week
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Solve Geometric & Graph Problems with Large Language Models☆28Updated 2 years ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and perf…☆12Updated 2 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆14Updated 2 weeks ago
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆19Updated 2 weeks ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 4 months ago
- The Swarm Ecosystem☆18Updated 7 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 9 months ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- efficient query encoding for dense retrieval☆11Updated 7 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 5 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆22Updated 2 weeks ago
- An autonomous Mall assistant that can answer user queries using tools. Powered by LLMs.☆14Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆29Updated 7 months ago
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆25Updated this week