☆19May 4, 2023Updated 2 years ago
Alternatives and similar repositories for Decentralized_FM_alpha
Users that are interested in Decentralized_FM_alpha are comparing it to the libraries listed below
Sorting:
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths☆15Jul 10, 2025Updated 7 months ago
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆21Sep 10, 2024Updated last year
- ☆20Jun 3, 2023Updated 2 years ago
- LLM checkpointing for DeepSpeed/Megatron☆25Nov 30, 2025Updated 3 months ago
- ☆34Jun 22, 2024Updated last year
- A universal workflow system for exactly-once DAGs☆23Jun 1, 2023Updated 2 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆94Jul 14, 2023Updated 2 years ago
- A resilient distributed training framework☆97Apr 11, 2024Updated last year
- ☆27Aug 25, 2023Updated 2 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Apr 25, 2023Updated 2 years ago
- ☆29May 28, 2024Updated last year
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆144Dec 4, 2024Updated last year
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆36Aug 29, 2025Updated 6 months ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- netbeacon - monitoring your network capture, NIDS or network analysis process☆19Oct 26, 2013Updated 12 years ago
- Training project about Deep Learing☆12Jun 22, 2017Updated 8 years ago
- coursework from classes at UW☆12May 14, 2019Updated 6 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆34May 6, 2024Updated last year
- Stateful LLM Serving☆96Mar 11, 2025Updated 11 months ago
- [NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆87Nov 29, 2025Updated 3 months ago
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆52Aug 6, 2025Updated 6 months ago
- ☆11Oct 21, 2023Updated 2 years ago
- Error-controlled interaction discovery in machine learning models☆22Jun 24, 2025Updated 8 months ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- ☆10Jun 4, 2021Updated 4 years ago
- Jieba 0.39 的 Java 复刻版,支持原版 Jieba 的所有核心功能☆12Feb 14, 2019Updated 7 years ago
- For our ISSTA'23 paper ACETest: Automated Constraint Extraction for Testing Deep Learning Operators☆13Mar 30, 2024Updated last year
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- ☆10Apr 30, 2020Updated 5 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- ☆13Jan 21, 2022Updated 4 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆25Jul 26, 2025Updated 7 months ago
- ☆10May 16, 2021Updated 4 years ago
- Wrapper for publishing buffered metrics to Cloudwatch☆10Oct 14, 2019Updated 6 years ago
- ⬆ A program for deploying and upgrading programs.☆28Mar 2, 2023Updated 3 years ago