dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,424Updated 2 weeks ago
Alternatives and similar repositories for WordLlama:
Users that are interested in WordLlama are comparing it to the libraries listed below
- Fast State-of-the-Art Static Embeddings☆1,060Updated this week
- High-performance retrieval engine for unstructured data☆1,165Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆1,888Updated 2 weeks ago
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,599Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆2,924Updated last week
- A system for agentic LLM-powered data processing and ETL☆1,669Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,295Updated last week
- Build and query dynamic, temporally-aware Knowledge Graphs☆1,915Updated last week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆898Updated 3 weeks ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆604Updated 2 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆807Updated this week
- Implementing the 4 agentic patterns from scratch☆1,027Updated 3 weeks ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,565Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆732Updated 3 weeks ago
- A scientific instrument for investigating latent spaces☆653Updated 2 weeks ago
- LLM Analytics☆642Updated 4 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆1,633Updated 2 months ago
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,062Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,505Updated last week
- Improved file parsing for LLM’s☆2,804Updated 3 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,019Updated last month
- Fast Semantic Text Deduplication☆525Updated this week
- ☆740Updated 10 months ago
- Vision model based document ingestion☆1,647Updated this week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,328Updated 3 weeks ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,019Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,266Updated last week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,185Updated 4 months ago