GeeeekExplorer / 3d-parallel-demoLinks
使用torch.distributed实现DP/TP/PP
☆12Updated 2 years ago
Alternatives and similar repositories for 3d-parallel-demo
Users that are interested in 3d-parallel-demo are comparing it to the libraries listed below
Sorting:
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Updated 10 months ago
- [ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models☆27Updated 7 months ago
- ☆74Updated 9 months ago
- ☆209Updated 3 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆273Updated last year
- ☆50Updated 5 months ago
- ☆218Updated 2 months ago
- Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)☆54Updated 10 months ago
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆147Updated last month
- Evaluation utilities based on SymPy.☆21Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆226Updated 2 months ago
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆49Updated 6 months ago
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆214Updated 11 months ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆289Updated 3 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆344Updated 2 weeks ago
- Reproducing R1 for Code with Reliable Rewards☆286Updated 9 months ago
- [ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆532Updated last month
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆191Updated last week
- Efficient Mixture of Experts for LLM Paper List☆166Updated 4 months ago
- ☆215Updated 11 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆276Updated last week
- Model merging is a highly efficient approach for long-to-short reasoning.☆98Updated 3 months ago
- official code for GliDe with a CaPE☆20Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆201Updated 2 months ago
- (best/better) practices of megatron on veRL and tuning guide☆129Updated 4 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆249Updated 9 months ago
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆17Updated last year
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆61Updated 4 months ago
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆363Updated 9 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆246Updated 4 months ago