compound-ai-systems / awesome-compound-ai-systems
A curated list of awesome Compound AI Systems
☆23Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-compound-ai-systems
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆53Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆103Updated last week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆93Updated 5 months ago
- Three examples of recommendation system pipelines with NVIDIA Merlin and Redis☆57Updated last year
- ☆22Updated last week
- ☆47Updated 2 months ago
- Benchmark suite for LLMs from Fireworks.ai☆59Updated 2 weeks ago
- Tutorial to get started with SkyPilot!☆56Updated 6 months ago
- Modular and structured prompt caching for low-latency LLM inference☆69Updated 2 weeks ago
- Benchmarking suite for popular AI APIs☆77Updated 2 weeks ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆57Updated last month
- End-to-End LLM Guide☆97Updated 4 months ago
- ☆20Updated this week
- ☆38Updated 4 months ago
- LLM Serving Performance Evaluation Harness☆57Updated 2 months ago
- Self-host LLMs with vLLM and BentoML☆74Updated last week
- Packages and instructions for training and inference of LLMs on NVIDIA's new GH200 machines☆19Updated 2 months ago
- A collection of all available inference solutions for the LLMs☆73Updated 2 months ago
- ☆29Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Framework for building data agent workflows☆81Updated 3 months ago
- Ray - A curated list of resources: https://github.com/ray-project/ray☆42Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications