awesomelistsio / awesome-ai-infrastructureLinks
A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.
☆23Updated 3 months ago
Alternatives and similar repositories for awesome-ai-infrastructure
Users that are interested in awesome-ai-infrastructure are comparing it to the libraries listed below
Sorting:
- Benchmarking Deep Learning Frameworks☆13Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 5 months ago
- ☆11Updated last year
- A tool to convert image of sheet music into an .wav audio file☆18Updated 2 months ago
- ☆27Updated 7 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated last month
- HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.☆58Updated last week
- ☆32Updated last week
- ☆44Updated 6 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 7 months ago
- Code for MLSys 2024 Paper "SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models"☆21Updated last year
- A modular and AI-powered appointment booking agent designed to streamline scheduling for businesses, starting with dental clinics. Built …☆23Updated 9 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 6 months ago
- The AI-PMS Microservice uses AI to predict aircraft system failures before they occur, optimizing maintenance and enhancing safety. This …☆13Updated last year
- ☆53Updated 4 months ago
- A curated list of awesome Compound AI Systems☆35Updated 4 months ago
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆88Updated last year
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆52Updated last month
- AlphaXIV open-source alternative: Chat with any arXiv paper.☆93Updated 5 months ago
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆261Updated 6 months ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆210Updated 2 weeks ago
- Material for Ray Connect 2024 Conference☆12Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Updated last year
- Self-host LLMs with LMDeploy and BentoML☆21Updated 4 months ago
- The driver for LMCache core to run in vLLM☆58Updated 9 months ago
- ☆102Updated last year
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆17Updated 2 weeks ago
- ☆61Updated 11 months ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆56Updated 3 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆80Updated 2 months ago