royeisen / reasoning_loading_barLinks
☆33Updated this week
Alternatives and similar repositories for reasoning_loading_bar
Users that are interested in reasoning_loading_bar are comparing it to the libraries listed below
Sorting:
- ☆23Updated 3 weeks ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆95Updated last month
- ☆47Updated this week
- ☆19Updated 4 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 2 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆66Updated 3 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆45Updated last week
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated 10 months ago
- Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆28Updated 3 weeks ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆18Updated last month
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆20Updated 2 months ago
- A repository for research on medium sized language models.☆77Updated last year
- ☆17Updated this week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆26Updated 3 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- ☆24Updated 9 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- ☆13Updated 7 months ago
- a tool for gerenate dataset from doc☆12Updated 3 months ago
- Official code repository for Sketch-of-Thought (SoT)☆124Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆81Updated last month
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆21Updated 7 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆39Updated 3 months ago
- XmodelLM☆39Updated 7 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆25Updated last month
- ☆52Updated last week
- ☆47Updated 9 months ago