MobileLLM / ParaThinkerLinks
☆41Updated last month
Alternatives and similar repositories for ParaThinker
Users that are interested in ParaThinker are comparing it to the libraries listed below
Sorting:
- ☆83Updated last week
- SSRL: Self-Search Reinforcement Learning☆158Updated 4 months ago
- ☆66Updated 6 months ago
- ☆51Updated 10 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆31Updated 4 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆61Updated last year
- ☆106Updated last week
- Sotopia-RL: Reward Design for Social Intelligence☆45Updated 4 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86Updated 6 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆27Updated 5 months ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆27Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- Process Reward Models That Think☆64Updated 3 weeks ago
- ☆35Updated 7 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆143Updated 3 months ago
- ☆19Updated 9 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- ☆85Updated last month
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 7 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Updated last year
- Code for "Variational Reasoning for Language Models"☆52Updated 2 months ago
- ☆49Updated 8 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆28Updated last year
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆40Updated last month
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆106Updated 2 months ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Updated 2 months ago