outerport / awesome-compound-ai-systemsLinks
Papers about infrastructure (deployment & serving) and systems for compound AI
☆11Updated 11 months ago
Alternatives and similar repositories for awesome-compound-ai-systems
Users that are interested in awesome-compound-ai-systems are comparing it to the libraries listed below
Sorting:
- ☆12Updated 3 months ago
- ☆32Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆15Updated 10 months ago
- working implimention of deepseek MLA☆44Updated 9 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆59Updated last year
- some mixture of experts architecture implementations☆22Updated last year
- ☆14Updated 2 weeks ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated 10 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 7 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 9 months ago
- Fork of Flame repo for training of some new stuff in development☆18Updated last week
- KV cache compression via sparse coding☆14Updated 5 months ago
- RWKV-7: Surpassing GPT☆98Updated 11 months ago
- Lego for GRPO☆30Updated 4 months ago
- A tool for an analysis of LLM generations.☆40Updated this week
- H-Net Dynamic Hierarchical Architecture☆80Updated last month
- Demo tutorial on how to program in Python an autonomous bot that plays the GeoGuessr game, using different Vision LLMs with LangChain☆11Updated 11 months ago
- ☆63Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 10 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆72Updated 4 months ago
- Samples of good AI generated CUDA kernels☆91Updated 4 months ago
- ☆15Updated 10 months ago
- ☆33Updated 9 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆51Updated last week
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆47Updated last week
- Code for paper "Analog Foundation Models"☆27Updated last month
- Experimental GPU language with meta-programming☆23Updated last year
- CodeRepoQA dataset☆12Updated 8 months ago
- LLM training in simple, raw C/CUDA☆18Updated last year
- MPI Code Generation through Domain-Specific Language Models☆14Updated 11 months ago