OpenRL-Lab / Ray_Tutorial
Tutorial for Ray
☆18Updated 10 months ago
Alternatives and similar repositories for Ray_Tutorial:
Users that are interested in Ray_Tutorial are comparing it to the libraries listed below
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆49Updated last month
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 8 months ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆61Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆148Updated this week
- ☆57Updated 2 months ago
- Minimal RLHF implementation built on top of minGPT.☆29Updated 7 months ago
- A Really Scalable RL Framework to 10k+ CPUs☆25Updated 11 months ago
- ☆30Updated 5 months ago
- ☆88Updated last month
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆39Updated 3 months ago
- Vocabulary Parallelism☆17Updated 3 months ago
- Efficient Mixture of Experts for LLM Paper List☆36Updated 2 months ago
- ☆48Updated last year
- Natural Language Reinforcement Learning☆72Updated 2 months ago
- Estimate MFU for DeepSeekV3☆16Updated last month
- Linear Attention Sequence Parallelism (LASP)☆77Updated 8 months ago
- SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference☆40Updated 3 months ago
- ☆96Updated 10 months ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆46Updated 7 months ago
- ☆30Updated 8 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆17Updated 8 months ago
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆84Updated 4 months ago
- ☆98Updated 2 months ago
- ☆62Updated last week
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆42Updated 4 months ago
- ☆14Updated 10 months ago