HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆145Updated last month
Alternatives and similar repositories for RLLoggingBoard:
Users that are interested in RLLoggingBoard are comparing it to the libraries listed below
- A flexible and efficient training framework for large-scale alignment tasks☆303Updated last week
- ☆88Updated last month
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆216Updated this week
- The related works and background techniques about Openai o1☆210Updated last month
- ☆57Updated 2 months ago
- adds Sequence Parallelism into LLaMA-Factory☆154Updated this week
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆173Updated last year
- ☆473Updated last month
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 8 months ago
- ☆104Updated 3 months ago
- An automated pipeline for evaluating LLMs for role-playing.☆158Updated 5 months ago
- A series of technical report on Slow Thinking with LLM☆409Updated last week
- ☆139Updated 7 months ago
- ☆98Updated 2 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆359Updated 6 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆152Updated this week
- ☆225Updated 9 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆223Updated 2 weeks ago
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆217Updated 3 weeks ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆219Updated last month
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆240Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆125Updated 2 months ago
- ☆209Updated 9 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆233Updated 3 months ago
- ☆89Updated 2 months ago
- ☆318Updated 7 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆245Updated 5 months ago