Dicklesworthstone / swiss_army_llamaView external linksLinks
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
☆1,044Feb 27, 2025Updated 11 months ago
Alternatives and similar repositories for swiss_army_llama
Users that are interested in swiss_army_llama are comparing it to the libraries listed below
Sorting:
- High-performance vector similarity library in Rust with Python bindings: Spearman, Kendall, distance correlation, Jensen-Shannon, Hoeffdi…☆426Feb 27, 2025Updated 11 months ago
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned models☆2,781May 25, 2024Updated last year
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,130Feb 10, 2026Updated last week
- Seamlessly integrate LLMs as Python functions☆2,386Nov 24, 2025Updated 2 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,003Feb 11, 2026Updated last week
- Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs☆2,860Jan 22, 2026Updated 3 weeks ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,445Dec 9, 2025Updated 2 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,902Feb 24, 2024Updated last year
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,519Jul 3, 2024Updated last year
- structured outputs for llms☆12,357Feb 10, 2026Updated last week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,643Jul 31, 2024Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 9 months ago
- Go ahead and axolotl questions☆11,289Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,102Feb 9, 2026Updated last week
- Things you can do with the token embeddings of an LLM☆1,454Dec 1, 2025Updated 2 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,670Feb 5, 2026Updated last week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,714Feb 9, 2026Updated last week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,125Jan 29, 2026Updated 2 weeks ago
- A guidance language for controlling large language models.☆21,309Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,155Feb 8, 2026Updated last week
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,407Feb 3, 2026Updated 2 weeks ago
- an ambient intelligence library☆6,081Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,473Jun 7, 2025Updated 8 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,911Sep 30, 2023Updated 2 years ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆46,977Updated this week
- Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.☆647Feb 27, 2025Updated 11 months ago
- Replace Splunk in your small company with this one weird trick!☆424Feb 27, 2025Updated 11 months ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,514Jan 26, 2025Updated last year
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,673Nov 7, 2025Updated 3 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,942Sep 7, 2024Updated last year
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,719May 21, 2025Updated 8 months ago
- Large Action Model framework to develop AI Web Agents☆6,295Jan 21, 2025Updated last year
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, o…☆9,457Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,973Updated this week
- 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, …☆2,461Nov 12, 2025Updated 3 months ago
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,673Aug 18, 2023Updated 2 years ago
- A language for constraint-guided and efficient LLM programming.☆4,148May 22, 2025Updated 8 months ago