microsoft / aici
AICI: Prompts as (Wasm) Programs
☆2,000Updated 3 weeks ago
Alternatives and similar repositories for aici:
Users that are interested in aici are comparing it to the libraries listed below
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,266Updated last week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,485Updated this week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,609Updated 7 months ago
- Blazingly fast LLM inference.☆5,064Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆973Updated 2 weeks ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆688Updated 5 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,635Updated 6 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,959Updated this week
- Chat language model that can use tools and interpret the results☆1,518Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,879Updated 3 weeks ago
- An LLM-powered advanced RAG pipeline built from scratch☆827Updated last year
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,477Updated 3 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,362Updated last week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,565Updated this week
- A language for constraint-guided and efficient LLM programming.☆3,823Updated 8 months ago
- Training LLMs with QLoRA + FSDP☆1,451Updated 3 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,388Updated 2 months ago
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆706Updated last month
- Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild☆2,051Updated this week
- 🕸️🦀 A WASM vector similarity search written in Rust☆919Updated last year
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,295Updated last week
- ☆740Updated 10 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆852Updated last year
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,782Updated this week
- A cross-platform browser ML framework.☆658Updated 2 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆1,992Updated 3 months ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆2,832Updated 2 weeks ago
- ☆2,852Updated 5 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,751Updated 2 months ago
- ☆806Updated 5 months ago