Accelerating Multitask Training Trough Adaptive Transition [Efficient ML Model]
☆12May 23, 2025Updated 9 months ago
Alternatives and similar repositories for MT2ST
Users that are interested in MT2ST are comparing it to the libraries listed below
Sorting:
- Adaptive Topology Reconstruction for Robust Graph Representation Learning [Efficient ML Model]☆10Feb 11, 2025Updated last year
- Efficient Foundation Model Design: A Perspective From Model and System Co-Design [Efficient ML System & Model]☆28Feb 23, 2025Updated last year
- GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]☆40Jan 1, 2026Updated 2 months ago
- A Serving System for Distributed and Parallel LLM Quantization [Efficient ML System]☆26Jun 18, 2025Updated 8 months ago
- FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]☆46Feb 17, 2026Updated 2 weeks ago
- This is the unofficial implementation of LEMON (ICLR'2024).☆12Apr 14, 2024Updated last year
- Source code for "Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference." In NeurIPS 2024☆21Dec 1, 2024Updated last year
- PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆48Feb 24, 2026Updated last week
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆36Jun 7, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆51Jun 12, 2025Updated 8 months ago
- tmp DPI☆14Dec 18, 2024Updated last year
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- ☆13Oct 7, 2024Updated last year
- ☆13May 10, 2024Updated last year
- ☆49Mar 3, 2024Updated 2 years ago
- ☆17Mar 14, 2024Updated last year
- ☆13Jul 28, 2023Updated 2 years ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆51Oct 18, 2024Updated last year
- [ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks☆22Mar 21, 2025Updated 11 months ago
- [COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection☆17Jan 21, 2025Updated last year
- Official python implementation for ICML 2024: "Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem"☆17Jul 1, 2024Updated last year
- AVPipe :-)☆12Jul 16, 2021Updated 4 years ago
- Unleashing Reasoning in Medical Large Language Models☆12Mar 19, 2025Updated 11 months ago
- This is a implementation of DRGCN☆14Aug 18, 2022Updated 3 years ago
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- A transformer model that should be able to solve a simple NER task☆11Mar 7, 2019Updated 6 years ago
- Implementation of A* Planning Algorithm in 3D Environment☆13Oct 18, 2017Updated 8 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise func…☆21Oct 22, 2025Updated 4 months ago
- ☆11Jul 28, 2021Updated 4 years ago
- The official Implementation for TKDE paper "Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalizatio…☆14Aug 6, 2023Updated 2 years ago
- [ICML'25] Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting | 样本级别的自适应多模型集成时间序列预测☆24May 22, 2025Updated 9 months ago
- PyTorch implementation of Language model compression with weighted low-rank factorization☆13Jun 28, 2023Updated 2 years ago
- LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently (ICML2025 Oral)☆28Oct 22, 2025Updated 4 months ago
- A Compute Express Link (CXL) Benchmark Suite☆20Feb 12, 2025Updated last year
- ☆12Apr 17, 2025Updated 10 months ago
- ☆15Mar 21, 2025Updated 11 months ago