consciousness-lab / ctm-aiLinks
A platform to develop CTM-motivated AI architecture.
☆15Updated 3 weeks ago
Alternatives and similar repositories for ctm-ai
Users that are interested in ctm-ai are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆138Updated last month
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆93Updated last year
- ☆57Updated 5 months ago
- ☆55Updated 6 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆154Updated 5 months ago
- ☆27Updated 8 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Updated last year
- A Sober Look at Language Model Reasoning☆89Updated last month
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆72Updated 5 months ago
- A Survey of Direct Preference Optimization (DPO)☆86Updated 5 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆168Updated 9 months ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆78Updated 6 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆86Updated 6 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆213Updated 11 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 5 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆192Updated 11 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆69Updated 8 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆72Updated 6 months ago
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 5 months ago
- A repo for open research on building large reasoning models☆121Updated last week
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆244Updated this week
- OpenReivew Submission Visualization (ICLR 2024/2025)☆153Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆147Updated last year
- ☆217Updated 8 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆63Updated last year
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆61Updated 6 months ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆37Updated 5 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆188Updated 9 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 9 months ago
- ☆50Updated 2 months ago