microsoft / aiciLinks
AICI: Prompts as (Wasm) Programs
☆2,028Updated 4 months ago
Alternatives and similar repositories for aici
Users that are interested in aici are comparing it to the libraries listed below
Sorting:
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,473Updated 2 weeks ago
- Blazingly fast LLM inference.☆5,644Updated this week
- A language for constraint-guided and efficient LLM programming.☆3,943Updated last week
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆682Updated 9 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,985Updated last week
- Chat language model that can use tools and interpret the results☆1,553Updated 3 weeks ago
- 🕸️🦀 A WASM vector similarity search written in Rust☆963Updated last year
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,671Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,051Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,111Updated 2 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,737Updated last year
- A realtime serving engine for Data-Intensive Generative AI Applications☆1,007Updated this week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Updated 5 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,985Updated 9 months ago
- Deep learning at the speed of light.☆1,673Updated this week
- A cross-platform browser ML framework.☆696Updated 6 months ago
- Structured Text Generation☆11,666Updated this week
- Training LLMs with QLoRA + FSDP☆1,479Updated 6 months ago
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆764Updated this week
- A simple, performant and scalable Jax LLM!☆1,734Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,614Updated 3 weeks ago
- Harness LLMs with Multi-Agent Programming☆3,349Updated this week
- Things you can do with the token embeddings of an LLM☆1,442Updated 2 months ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,510Updated 4 months ago
- A fast llama2 decoder in pure Rust.☆1,050Updated last year
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆2,938Updated last month
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,825Updated 6 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,011Updated last month
- ☆2,952Updated 8 months ago
- Optimizing inference proxy for LLMs☆2,427Updated this week