GeeeekExplorer / 3d-parallel-demoLinks
使用torch.distributed实现DP/TP/PP
☆11Updated last year
Alternatives and similar repositories for 3d-parallel-demo
Users that are interested in 3d-parallel-demo are comparing it to the libraries listed below
Sorting:
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆55Updated 4 months ago
- [ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models☆19Updated last month
- Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)☆303Updated 3 months ago
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆200Updated 6 months ago
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆37Updated 3 weeks ago
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆105Updated 4 months ago
- Evaluation utilities based on SymPy.☆20Updated 8 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆77Updated 2 months ago
- ☆140Updated last month
- ☆65Updated 4 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆218Updated this week
- (best/better) practices of megatron on veRL and tuning guide☆68Updated last week
- Reproducing R1 for Code with Reliable Rewards☆246Updated 3 months ago
- Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)☆43Updated 5 months ago
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆17Updated 9 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆241Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆149Updated last week
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆171Updated last month
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆307Updated 3 months ago
- ☆43Updated 8 months ago
- A Comprehensive Survey on Long Context Language Modeling☆170Updated last month
- Multi-Candidate Speculative Decoding☆36Updated last year
- ☆114Updated 2 months ago
- Paper list for Efficient Reasoning.☆586Updated this week
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆82Updated this week
- ☆206Updated 9 months ago
- ☆206Updated 5 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆280Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆244Updated 2 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆112Updated 8 months ago