consciousness-lab / ctm-ai
A platform to develop CTM-motivated AI architecture.
☆12Updated this week
Alternatives and similar repositories for ctm-ai:
Users that are interested in ctm-ai are comparing it to the libraries listed below
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆62Updated 3 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆138Updated 3 weeks ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆119Updated 6 months ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆19Updated last year
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆51Updated 4 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆120Updated 2 weeks ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆130Updated last month
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆61Updated 10 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆167Updated 2 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆106Updated this week
- ☆129Updated this week
- A Survey on the Honesty of Large Language Models☆56Updated 3 months ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- Repo of paper "Free Process Rewards without Process Labels"☆138Updated 2 weeks ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data"☆35Updated last month
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆147Updated 3 months ago
- ☆61Updated 4 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆83Updated 8 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆28Updated this week
- Implementation of the MATRIX framework (ICML 2024)☆48Updated 10 months ago
- GenRM-CoT: Data release for verification rationales☆53Updated 5 months ago
- The official code repository for PRMBench.☆68Updated last month
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆27Updated last month
- AnchorAttention: Improved attention for LLMs long-context training☆206Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆67Updated last week
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Updated 11 months ago
- ☆25Updated 10 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 4 months ago