sail-sg / GDPO
Graph Diffusion Policy Optimization
☆34Updated last year
Alternatives and similar repositories for GDPO:
Users that are interested in GDPO are comparing it to the libraries listed below
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆39Updated this week
- Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…☆125Updated 7 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆144Updated last month
- ☆59Updated 10 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆124Updated 2 months ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆21Updated 9 months ago
- OpenReivew Submission Visualization (ICLR 2024/2025)☆152Updated 6 months ago
- ☆18Updated 3 weeks ago
- ☆40Updated 2 months ago
- The code of RouterDC☆57Updated this week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆64Updated 6 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆53Updated last month
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆22Updated last week
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆67Updated 2 months ago
- [ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"☆35Updated 9 months ago
- Code repository for Trajectory Flow Matching☆60Updated 5 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆110Updated this week
- ☆54Updated 5 months ago
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆126Updated 2 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆79Updated 8 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆154Updated last month
- ☆31Updated 3 months ago
- ☆16Updated last week
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆20Updated last month
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆120Updated 4 months ago
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆57Updated last year
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆17Updated last month
- ☆144Updated 7 months ago
- SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator☆69Updated 3 months ago
- ☆52Updated 5 months ago