dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,433Updated last month
Alternatives and similar repositories for WordLlama:
Users that are interested in WordLlama are comparing it to the libraries listed below
- Fast State-of-the-Art Static Embeddings☆1,109Updated 3 weeks ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,198Updated 5 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆753Updated last month
- clean & curate your data with LLMs.☆484Updated 8 months ago
- A system for agentic LLM-powered data processing and ETL☆1,718Updated this week
- High-performance retrieval engine for unstructured data☆1,272Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,608Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆2,035Updated last week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆601Updated 3 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,338Updated last month
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,818Updated this week
- Fast Semantic Text Deduplication☆582Updated 3 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,042Updated 3 weeks ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,614Updated this week
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆914Updated 2 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,063Updated last week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,057Updated this week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,343Updated last month
- ☆742Updated 11 months ago
- Felafax is building AI infra for non-NVIDIA GPUs☆555Updated 2 months ago
- LLM Analytics☆646Updated 5 months ago
- ☆434Updated 6 months ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,542Updated this week
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆728Updated this week
- Improved file parsing for LLM’s☆2,866Updated 4 months ago
- Optimizing inference proxy for LLMs☆2,110Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,328Updated last month