compound-ai-systems / awesome-compound-ai-systemsLinks
A curated list of awesome Compound AI Systems
☆34Updated last month
Alternatives and similar repositories for awesome-compound-ai-systems
Users that are interested in awesome-compound-ai-systems are comparing it to the libraries listed below
Sorting:
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆132Updated last year
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆59Updated 2 years ago
- Framework for building data agent workflows☆82Updated 11 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 2 months ago
- ☆213Updated last month
- A collection of all available inference solutions for the LLMs☆91Updated 5 months ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 7 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆218Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆128Updated this week
- ☆76Updated 6 months ago
- r2e: turn any github repository into a programming agent environment☆129Updated 3 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆152Updated 3 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆95Updated 10 months ago
- LLM Serving Performance Evaluation Harness☆79Updated 5 months ago
- ☆80Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆73Updated 8 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆125Updated this week
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆195Updated last week
- ☆25Updated this week
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆174Updated last year
- Benchmarking suite for popular AI APIs☆87Updated 6 months ago
- ArcticInference: vLLM plugin for high-throughput, low-latency inference☆207Updated this week
- A list of LLM benchmark frameworks.☆68Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆176Updated 5 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- ☆28Updated 4 months ago
- ☆19Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆82Updated this week
- ☆41Updated last year