Things you can do with the token embeddings of an LLM
☆1,451Dec 1, 2025Updated 6 months ago
Alternatives and similar repositories for WordLlama
Users that are interested in WordLlama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast State-of-the-Art Static Embeddings☆2,127Jun 6, 2026Updated last week
- ai for jq☆249Sep 20, 2024Updated last year
- LLM Analytics☆714Oct 19, 2024Updated last year
- Structured Outputs☆13,964May 18, 2026Updated last month
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,657Jun 11, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,621Dec 20, 2025Updated 5 months ago
- ☆447Sep 18, 2024Updated last year
- The Context Layer for unstructured data: typed, versioned datasets over S3, GCS, Azure☆2,782Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,141May 19, 2025Updated last year
- Improved file parsing for LLM’s☆3,162May 17, 2026Updated last month
- Felafax is building AI infra for non-NVIDIA GPUs☆570Jan 24, 2025Updated last year
- DSPy: The framework for programming—not prompting—language models☆35,064Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,887Nov 7, 2025Updated 7 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,926Feb 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆5,041Aug 10, 2024Updated last year
- Go ahead and axolotl questions☆12,061Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,800Nov 1, 2025Updated 7 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆227Dec 24, 2024Updated last year
- Optimizing inference proxy for LLMs☆4,147May 7, 2026Updated last month
- A vector search SQLite extension that runs anywhere!☆7,730May 18, 2026Updated last month
- High-performance retrieval engine for unstructured data☆1,585Nov 10, 2025Updated 7 months ago
- structured outputs for llms☆13,181Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,934May 17, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,446Apr 30, 2026Updated last month
- Everything about the SmolLM and SmolVLM family of models☆3,811May 26, 2026Updated 3 weeks ago
- Large Action Model framework to develop AI Web Agents☆6,371Jan 21, 2025Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,091May 26, 2026Updated 3 weeks ago
- An open-source RAG-based tool for chatting with your documents.☆25,467Jun 9, 2026Updated last week
- Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs☆2,929Mar 22, 2026Updated 2 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,245Sep 11, 2025Updated 9 months ago
- Local realtime voice AI☆2,484Nov 26, 2025Updated 6 months ago
- Fast Multimodal Semantic Deduplication & Filtering☆936May 24, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆784Aug 12, 2024Updated last year
- grep for words with similar meaning to the query☆1,236Aug 19, 2024Updated last year
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆850Jan 28, 2025Updated last year
- A system for agentic LLM-powered data processing and ETL☆3,835Updated this week
- Distribute and run LLMs with a single file.☆24,950Jun 9, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆20,840Updated this week
- A guidance language for controlling large language models.☆21,500May 21, 2026Updated 3 weeks ago