Zayne-sprague / To-CoT-or-not-to-CoTView external linksLinks
☆25Apr 10, 2025Updated 10 months ago
Alternatives and similar repositories for To-CoT-or-not-to-CoT
Users that are interested in To-CoT-or-not-to-CoT are comparing it to the libraries listed below
Sorting:
- ☆29Nov 9, 2025Updated 3 months ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆25Aug 24, 2025Updated 5 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Feb 4, 2025Updated last year
- ☆56Aug 10, 2024Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆19Nov 17, 2025Updated 3 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- ☆26Jun 5, 2025Updated 8 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 6 months ago
- ☆21Oct 25, 2024Updated last year
- Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of…☆16Jun 1, 2024Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- This is the official implementation of the paper “Griffin: Towards a Graph-Centric Relational Database Foundation Model.”☆31Sep 25, 2025Updated 4 months ago
- ☆33Jul 9, 2025Updated 7 months ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 10 months ago
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆38Feb 3, 2026Updated 2 weeks ago
- ☆16Jul 23, 2024Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆24Feb 25, 2025Updated 11 months ago
- ☆17Dec 21, 2023Updated 2 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated 11 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Jul 25, 2025Updated 6 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks☆46Oct 5, 2025Updated 4 months ago
- ☆21Jul 25, 2025Updated 6 months ago
- ☆52Feb 12, 2025Updated last year
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 7 months ago
- ☆23Jul 5, 2024Updated last year
- ☆46Jun 24, 2025Updated 7 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆59Feb 6, 2026Updated last week
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Apr 26, 2025Updated 9 months ago
- FID computation in Jax/Flax.☆29Jul 17, 2024Updated last year
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 7 months ago
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆17Jul 12, 2025Updated 7 months ago