outerport / awesome-compound-ai-systemsLinks
Papers about infrastructure (deployment & serving) and systems for compound AI
☆11Updated last year
Alternatives and similar repositories for awesome-compound-ai-systems
Users that are interested in awesome-compound-ai-systems are comparing it to the libraries listed below
Sorting:
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 10 months ago
- Demo tutorial on how to program in Python an autonomous bot that plays the GeoGuessr game, using different Vision LLMs with LangChain☆12Updated last year
- ☆32Updated last year
- ☆12Updated 4 months ago
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆10Updated 3 months ago
- working implimention of deepseek MLA☆45Updated 10 months ago
- ☆17Updated last month
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆60Updated last year
- KV cache compression via sparse coding☆14Updated last month
- ☆11Updated 11 months ago
- ☆28Updated last month
- ☆15Updated last year
- ☆19Updated 8 months ago
- Exploration into the Firefly algorithm in Pytorch☆41Updated 9 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated 11 months ago
- Repository to create traveling waves integrate special information through time☆56Updated 3 months ago
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆51Updated 2 weeks ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆42Updated 4 months ago
- ☆20Updated 8 months ago
- Latent Large Language Models☆19Updated last year
- Code and data for paper "(How) do Language Models Track State?"☆20Updated 7 months ago
- A collection of tricks and tools to speed up transformer models☆189Updated 3 weeks ago
- I learn about and explain quantization☆26Updated last year
- ☆43Updated last week
- Experimental GPU language with meta-programming☆24Updated last year
- A tool for an analysis of LLM generations.☆40Updated last month
- A showroom for various animations generated by large language models (LLM). Our method takes a rigged 3D model and produces novel animati…☆29Updated last year
- ☆20Updated 8 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆85Updated 2 months ago
- A powerful, enterprise-grade multi-agent system for advanced radiological analysis, diagnosis, and treatment planning. This system levera…☆13Updated last month