koayon / awesome-adaptive-computationView external linksLinks
A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).
☆160Jan 1, 2025Updated last year
Alternatives and similar repositories for awesome-adaptive-computation
Users that are interested in awesome-adaptive-computation are comparing it to the libraries listed below
Sorting:
- ☆22Aug 27, 2023Updated 2 years ago
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated last year
- A library for squeakily cleaning and filtering language datasets.☆49Jul 10, 2023Updated 2 years ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆163Apr 13, 2025Updated 10 months ago
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Oct 21, 2022Updated 3 years ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- LLM as World Models using Bayesian inference☆16May 27, 2025Updated 8 months ago
- Implementation of the Pairformer model used in AlphaFold 3☆14Updated this week
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- A curated list of projects and resources using BAML☆17Aug 1, 2025Updated 6 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆22May 31, 2025Updated 8 months ago
- https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23☆11Mar 29, 2023Updated 2 years ago
- A collection of AWESOME things about mixture-of-experts☆1,262Dec 8, 2024Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Feb 21, 2024Updated last year
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆114Updated this week
- ☆21Jan 27, 2026Updated 2 weeks ago
- batched loras☆349Sep 6, 2023Updated 2 years ago
- ☆29Jan 23, 2024Updated 2 years ago
- Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4☆963Dec 21, 2025Updated last month
- ☆273Oct 31, 2023Updated 2 years ago
- A plugin to use a language model to fill in parts of notes.☆16Feb 20, 2024Updated last year
- This is the official code for the paper "SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation".☆59Sep 27, 2024Updated last year
- ☆95Jul 26, 2023Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,657Mar 8, 2024Updated last year
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- Examples for running TeNPy☆16Oct 31, 2025Updated 3 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆16Aug 1, 2025Updated 6 months ago
- Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation (ICCV 2025)☆39Dec 10, 2025Updated 2 months ago
- Friends of OLMo and their links.☆356Sep 15, 2025Updated 4 months ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆344Apr 2, 2025Updated 10 months ago
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆116Nov 10, 2025Updated 3 months ago
- ☆80Mar 11, 2025Updated 11 months ago
- A bibliography and survey of the papers surrounding o1☆1,212Nov 16, 2024Updated last year
- What would you do with 1000 H100s...☆1,151Jan 10, 2024Updated 2 years ago
- Your AI assistant in the terminal.☆23Nov 22, 2024Updated last year