lperezmo / embeddings-extractionLinks
Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings for use in context-augmented LLM queries.
☆13Updated 10 months ago
Alternatives and similar repositories for embeddings-extraction
Users that are interested in embeddings-extraction are comparing it to the libraries listed below
Sorting:
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 2 weeks ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆18Updated 2 weeks ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 11 months ago
- Geniusrise: Framework for building geniuses☆60Updated last year
- ☆13Updated 2 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- ASTChunk is a Python toolkit for code chunking using Abstract Syntax Trees (ASTs), designed to create structurally sound and meaningful c…☆30Updated 2 weeks ago
- A narrow implementation of DiagramGPT for generating system architecture diagrams with local LLM models and Llama.cpp☆25Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Updated 6 months ago
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆21Updated 2 months ago
- A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and perf…☆13Updated 3 months ago
- create workflows with LLMs☆54Updated 11 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆35Updated 2 weeks ago
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆10Updated last year
- ☆11Updated 2 years ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆18Updated last year
- ☆22Updated last year
- ☆47Updated 9 months ago
- This repo lets you run mistral-7b in Google Colab.☆16Updated last year
- ☆40Updated 7 months ago
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆23Updated last year
- The minimal, ad-hoc way of plug and play NebulaGraph with pip install, even inside Colab Notebook!☆17Updated last year