microsoft / aiciLinks
AICI: Prompts as (Wasm) Programs
☆2,032Updated 4 months ago
Alternatives and similar repositories for aici
Users that are interested in aici are comparing it to the libraries listed below
Sorting:
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,815Updated last week
- All things prompt engineering☆5,627Updated last year
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,698Updated 11 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Updated 9 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,174Updated 3 months ago
- A language for constraint-guided and efficient LLM programming.☆3,961Updated 3 weeks ago
- Blazingly fast LLM inference.☆5,742Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,496Updated last month
- Training LLMs with QLoRA + FSDP☆1,485Updated 7 months ago
- 🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and mo…☆1,055Updated 5 months ago
- Chat language model that can use tools and interpret the results☆1,563Updated this week
- Minimal LLM inference in Rust☆994Updated 7 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,716Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,056Updated this week
- Agents Capable of Self-Editing Their Prompts / Python Code☆770Updated last year
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆779Updated this week
- ☆898Updated 9 months ago
- A cross-platform browser ML framework.☆701Updated 6 months ago
- A realtime serving engine for Data-Intensive Generative AI Applications☆1,020Updated this week
- A fast llama2 decoder in pure Rust.☆1,051Updated last year
- 🕸️🦀 A WASM vector similarity search written in Rust☆973Updated last year
- ☆447Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,820Updated 3 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Updated 6 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆841Updated last year
- Vision utilities for web interaction agents 👀☆1,688Updated 6 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,016Updated 3 months ago
- Super-fast Structured Outputs☆304Updated last week
- Customizable implementation of the self-instruct paper.☆1,044Updated last year
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,511Updated 4 months ago