lperezmo / embeddings-extraction
Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.
☆13Updated 8 months ago
Alternatives and similar repositories for embeddings-extraction:
Users that are interested in embeddings-extraction are comparing it to the libraries listed below
- Rust bindings for CTranslate2☆14Updated last year
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated 11 months ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- efficient query encoding for dense retrieval☆11Updated 8 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 9 months ago
- A framework for writing Unstract Tools/Apps☆18Updated this week
- FalkorDB-Browser is a visualization UI for FalkorDB.☆30Updated this week
- Geniusrise: Framework for building geniuses☆60Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- OpenAI compatible API for open source LLMs☆15Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆16Updated 7 months ago
- API to load and query documents using RAG☆15Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated last month
- An approach to perform RAG while taking into account the dynamic evolution of the data. Helpful to detect emerging trends in the data☆26Updated last year
- ☆22Updated 2 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- Python library to use Pleias-RAG models☆27Updated this week
- Run embedding models using ONNX☆32Updated last year
- A narrow implementation of DiagramGPT for generating system architecture diagrams with local LLM models and Llama.cpp☆24Updated 10 months ago
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 2 months ago
- 📚 Build knowledge bases for RAG☆17Updated this week
- Solve Geometric & Graph Problems with Large Language Models☆29Updated 2 years ago