A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
☆1,047Feb 27, 2025Updated last year
Alternatives and similar repositories for swiss_army_llama
Users that are interested in swiss_army_llama are comparing it to the libraries listed below
Sorting:
- High-performance vector similarity library in Rust with Python bindings: Spearman, Kendall, distance correlation, Jensen-Shannon, Hoeffdi…☆428Feb 25, 2026Updated last week
- Structured Outputs☆13,488Mar 2, 2026Updated last week
- Turn expensive prompts into cheap fine-tuned models☆2,787May 25, 2024Updated last year
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,266Updated this week
- Seamlessly integrate LLMs as Python functions☆2,393Nov 24, 2025Updated 3 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,022Feb 11, 2026Updated 3 weeks ago
- Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs☆2,880Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,451Updated this week
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,513Jul 3, 2024Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,905Feb 24, 2024Updated 2 years ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,645Jul 31, 2024Updated last year
- structured outputs for llms☆12,468Feb 25, 2026Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,868May 17, 2025Updated 9 months ago
- Go ahead and axolotl questions☆11,395Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,148Mar 2, 2026Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,703Feb 5, 2026Updated last month
- Things you can do with the token embeddings of an LLM☆1,453Dec 1, 2025Updated 3 months ago
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,456Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,745Updated this week
- A guidance language for controlling large language models.☆21,333Feb 13, 2026Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,206Mar 1, 2026Updated last week
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,449Feb 3, 2026Updated last month
- Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.☆659Feb 27, 2025Updated last year
- an ambient intelligence library☆6,096Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,478Jun 7, 2025Updated 9 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,913Sep 30, 2023Updated 2 years ago
- Replace Splunk in your small company with this one weird trick!☆424Feb 27, 2025Updated last year
- LlamaIndex is the leading document agent and OCR platform☆47,374Updated this week
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,515Jan 26, 2025Updated last year
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,720Nov 7, 2025Updated 4 months ago
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,732May 21, 2025Updated 9 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,982Sep 7, 2024Updated last year
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, o…☆9,555Updated this week
- 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, …☆2,461Nov 12, 2025Updated 3 months ago
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,672Aug 18, 2023Updated 2 years ago
- Large Action Model framework to develop AI Web Agents☆6,311Jan 21, 2025Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,135Updated this week
- A language for constraint-guided and efficient LLM programming.☆4,155May 22, 2025Updated 9 months ago