fangyuan-ksgk / CoT-Reasoning-without-Prompting
Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting
โ32Updated last year
Alternatives and similar repositories for CoT-Reasoning-without-Prompting:
Users that are interested in CoT-Reasoning-without-Prompting are comparing it to the libraries listed below
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ49Updated 11 months ago
- โ59Updated 7 months ago
- โ44Updated 5 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"โ74Updated 10 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspectiveโ63Updated last month
- โ45Updated 2 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracyโ60Updated 4 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.โ112Updated 3 weeks ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationalesโ81Updated 2 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.โ61Updated 5 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scalingโ101Updated 2 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learningโ90Updated this week
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073โ28Updated 9 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"โ38Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".โ48Updated last month
- โ21Updated 10 months ago
- Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Modelsโ33Updated 8 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingโ47Updated 3 months ago
- Large Language Models Can Self-Improve in Long-context Reasoningโ68Updated 4 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)โ57Updated 5 months ago
- โ91Updated last month
- โ50Updated last week
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correctionโ66Updated 3 weeks ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433โ25Updated 4 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsโ46Updated last month
- This is the official implementation of the paper "SยฒR: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"โ58Updated last month
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agenโฆโ26Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)โ107Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AIโ99Updated last month
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)โ57Updated last year