OpenRL-Lab / Ray_TutorialLinks
Tutorial for Ray
☆25Updated last year
Alternatives and similar repositories for Ray_Tutorial
Users that are interested in Ray_Tutorial are comparing it to the libraries listed below
Sorting:
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆64Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆216Updated 4 months ago
- A High-Efficiency System of Large Language Model Based Search Agents☆56Updated 3 weeks ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆127Updated this week
- ☆53Updated last week
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆64Updated 4 months ago
- ☆145Updated 5 months ago
- AI Alignment: A Comprehensive Survey☆135Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆63Updated 4 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆84Updated 3 months ago
- A research repo for experiments about Reinforcement Finetuning☆48Updated 2 months ago
- ☆27Updated 3 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆186Updated 3 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆45Updated this week
- ☆33Updated 9 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆133Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆100Updated 4 months ago
- analyse problems of AI with Math and Code☆17Updated 2 weeks ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆32Updated last month
- ☆82Updated last year
- ☆109Updated 7 months ago
- A Telegram bot to recommend arXiv papers☆275Updated 2 months ago
- ☆16Updated this week
- ☆44Updated 5 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆115Updated last month
- 使用单个24G显卡,从0开始训练LLM☆56Updated last month
- HFAI deep learning models☆148Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project☆18Updated 4 months ago
- tinybig for deep function learning☆60Updated 3 weeks ago