dleemiller / WordLlamaView external linksLinks
Things you can do with the token embeddings of an LLM
☆1,454Dec 1, 2025Updated 2 months ago
Alternatives and similar repositories for WordLlama
Users that are interested in WordLlama are comparing it to the libraries listed below
Sorting:
- Fast State-of-the-Art Static Embeddings☆1,996Feb 8, 2026Updated last week
- ai for jq☆249Sep 20, 2024Updated last year
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- LLM Analytics☆705Oct 19, 2024Updated last year
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,130Updated this week
- ☆442Sep 18, 2024Updated last year
- Felafax is building AI infra for non-NVIDIA GPUs☆570Jan 24, 2025Updated last year
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,594Dec 20, 2025Updated last month
- Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images☆2,721Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,123May 19, 2025Updated 8 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,902Feb 24, 2024Updated last year
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,673Nov 7, 2025Updated 3 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,597Aug 10, 2024Updated last year
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,693Nov 1, 2025Updated 3 months ago
- Improved file parsing for LLM’s☆3,151Nov 13, 2024Updated last year
- Go ahead and axolotl questions☆11,289Updated this week
- Large Action Model framework to develop AI Web Agents☆6,295Jan 21, 2025Updated last year
- Everything about the SmolLM and SmolVLM family of models☆3,602Jan 13, 2026Updated last month
- Optimizing inference proxy for LLMs☆3,324Jan 28, 2026Updated 2 weeks ago
- structured outputs for llms☆12,357Updated this week
- High-performance retrieval engine for unstructured data☆1,559Nov 10, 2025Updated 3 months ago
- A vector search SQLite extension that runs anywhere!☆6,858Jan 24, 2025Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 8 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆782Aug 12, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.☆25,019Jul 4, 2025Updated 7 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Sep 10, 2025Updated 5 months ago
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,826Jun 24, 2025Updated 7 months ago
- Distribute and run LLMs with a single file.☆23,704Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,930Sep 5, 2025Updated 5 months ago
- A system for agentic LLM-powered data processing and ETL☆3,557Feb 2, 2026Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,155Feb 8, 2026Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,210Sep 11, 2025Updated 5 months ago
- Local realtime voice AI☆2,429Nov 26, 2025Updated 2 months ago
- grep for words with similar meaning to the query☆1,209Aug 19, 2024Updated last year
- Seamlessly integrate LLMs as Python functions☆2,388Nov 24, 2025Updated 2 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,409Apr 21, 2025Updated 9 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,823Oct 28, 2025Updated 3 months ago