royeisen / reasoning_loading_barLinks
☆54Updated 7 months ago
Alternatives and similar repositories for reasoning_loading_bar
Users that are interested in reasoning_loading_bar are comparing it to the libraries listed below
Sorting:
- ☆29Updated 3 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆57Updated 4 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆108Updated 8 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Updated last year
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆53Updated last year
- ☆95Updated last year
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆30Updated 3 months ago
- ☆33Updated 6 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 4 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Updated 4 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Updated 4 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Updated last year
- ☆38Updated last year
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Updated last month
- Process Reward Models That Think☆78Updated 2 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 11 months ago
- ☆19Updated last year
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month
- Official code repository for Sketch-of-Thought (SoT)☆135Updated 9 months ago
- Official Repository of Native Parallel Reasoner☆100Updated last week
- ☆59Updated last month
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 10 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Updated last month
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60Updated last year
- ☆111Updated 4 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 7 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Updated 8 months ago
- ☆100Updated 6 months ago