outerport / awesome-compound-ai-systemsLinks
Papers about infrastructure (deployment & serving) and systems for compound AI
☆11Updated last year
Alternatives and similar repositories for awesome-compound-ai-systems
Users that are interested in awesome-compound-ai-systems are comparing it to the libraries listed below
Sorting:
- ☆32Updated last year
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆31Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Updated last year
- KV cache compression via sparse coding☆17Updated 2 months ago
- ☆13Updated 5 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated last year
- working implimention of deepseek MLA☆45Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆131Updated last year
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆63Updated this week
- ☆263Updated 7 months ago
- ☆14Updated 11 months ago
- ☆19Updated 10 months ago
- Enhancement in Multimodal Representation Learning.☆41Updated last year
- Synthetic data generator for image, video and 3D models☆32Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Updated 10 months ago
- RWKV-7: Surpassing GPT☆103Updated last year
- CodeRepoQA dataset☆15Updated 10 months ago
- AI-Driven Research Systems (ADRS)☆113Updated 3 weeks ago
- Defeating the Training-Inference Mismatch via FP16☆172Updated last month
- ☆55Updated last year
- ☆63Updated last year
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆23Updated last month
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆10Updated 4 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆118Updated 2 months ago
- MPI Code Generation through Domain-Specific Language Models☆14Updated last year
- ☆28Updated 11 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆53Updated last year