royeisen / reasoning_loading_barLinks
☆50Updated last month
Alternatives and similar repositories for reasoning_loading_bar
Users that are interested in reasoning_loading_bar are comparing it to the libraries listed below
Sorting:
- ☆30Updated last month
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 5 months ago
- ☆87Updated last month
- JudgeLRM: Large Reasoning Models as a Judge☆35Updated 4 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆51Updated 2 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆104Updated 3 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆21Updated 3 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆21Updated last month
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆104Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆69Updated 2 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆49Updated 9 months ago
- ☆19Updated 7 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆83Updated 4 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆64Updated 4 months ago
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆109Updated this week
- Process Reward Models That Think☆49Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆33Updated this week
- Codebase for Instruction Following without Instruction Tuning☆35Updated 11 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆27Updated 2 weeks ago
- ☆89Updated 9 months ago
- ☆25Updated 2 months ago
- ☆18Updated last month
- ☆77Updated last week
- ☆47Updated 6 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆56Updated this week
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆34Updated 5 months ago
- SSRL: Self-Search Reinforcement Learning☆93Updated last week
- [ACL 2025] Knowledge Unlearning for Large Language Models☆39Updated 3 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated 4 months ago
- Efficient Agent Training for Computer Use☆125Updated 2 months ago