microsoft / aici
AICI: Prompts as (Wasm) Programs
☆2,005Updated 2 months ago
Alternatives and similar repositories for aici:
Users that are interested in aici are comparing it to the libraries listed below
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,328Updated last month
- Blazingly fast LLM inference.☆5,297Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,746Updated 7 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,393Updated 3 months ago
- Reliable LLM Memory for AI Applications and AI Agents☆1,663Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,603Updated this week
- A SQLite extension for efficient vector search, based on Faiss!☆1,824Updated 10 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,969Updated 2 weeks ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,480Updated this week
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,491Updated 2 months ago
- Seamlessly integrate LLMs as Python functions☆2,233Updated 3 weeks ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,900Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,007Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,842Updated 3 weeks ago
- Things you can do with the token embeddings of an LLM☆1,433Updated last month
- Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C…☆2,606Updated last month
- A language for constraint-guided and efficient LLM programming.☆3,871Updated 9 months ago
- 🕸️🦀 A WASM vector similarity search written in Rust☆935Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,662Updated last year
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,609Updated this week
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,796Updated 4 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,640Updated 8 months ago
- Training LLMs with QLoRA + FSDP☆1,464Updated 4 months ago
- Optimizing inference proxy for LLMs☆2,112Updated last week
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆573Updated 7 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆685Updated 7 months ago
- A realtime serving engine for Data-Intensive Generative AI Applications☆983Updated this week
- Vision utilities for web interaction agents 👀☆1,627Updated 4 months ago
- A cross-platform browser ML framework.☆669Updated 4 months ago
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,346Updated last month