dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,440Updated last month
Alternatives and similar repositories for WordLlama:
Users that are interested in WordLlama are comparing it to the libraries listed below
- Fast State-of-the-Art Static Embeddings☆1,563Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,228Updated last month
- High-performance retrieval engine for unstructured data☆1,364Updated this week
- See Through Your Models☆389Updated last month
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,400Updated last month
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆275Updated last month
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,139Updated 2 weeks ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,647Updated 3 weeks ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,265Updated last month
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆607Updated last month
- Fast Semantic Text Deduplication & Filtering☆654Updated last week
- A scientific instrument for investigating latent spaces☆696Updated 2 weeks ago
- Fully neural approach for text chunking☆341Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆776Updated 3 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆269Updated 2 months ago
- LLM abstractions that aren't obstructions☆1,088Updated this week
- OCR Benchmark☆470Updated 2 weeks ago
- ☆741Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,426Updated 2 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆915Updated 3 months ago
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,359Updated 3 months ago
- LLM Analytics☆658Updated 6 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆852Updated 7 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,206Updated last month
- A hub for various industry-specific schemas to be used with VLMs.☆503Updated this week
- ☆438Updated 7 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,013Updated 3 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆571Updated 2 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆1,760Updated last week
- Optimizing inference proxy for LLMs☆2,201Updated last week