sail-sg / FlowReasoner
☆63Updated this week
Alternatives and similar repositories for FlowReasoner:
Users that are interested in FlowReasoner are comparing it to the libraries listed below
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆72Updated 3 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last month
- Knowledge Unlearning for Large Language Models☆25Updated 3 weeks ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆84Updated last month
- ☆40Updated 5 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆67Updated 2 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆141Updated 2 months ago
- The code of RouterDC☆57Updated last week
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆26Updated 2 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆53Updated last week
- ☆107Updated 3 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆115Updated last month
- ☆46Updated 2 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆77Updated 6 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆86Updated last month
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆50Updated 2 months ago
- The code implementation of Symbolic-MoE☆27Updated last month
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆20Updated 2 months ago
- Code for "A Sober Look at Progress in Language Model Reasoning" paper☆36Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆93Updated 6 months ago
- Critique-out-Loud Reward Models☆59Updated 6 months ago
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI☆21Updated 7 months ago
- ☆22Updated last week
- ☆50Updated 2 weeks ago
- ☆20Updated 2 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated 2 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year