sail-sg / GDPO
Graph Diffusion Policy Optimization
☆25Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for GDPO
- ☆15Updated last week
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆34Updated 4 months ago
- "Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?"☆58Updated last month
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆23Updated 9 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- [SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates☆60Updated 3 weeks ago
- ☆90Updated 4 months ago
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆29Updated last month
- [IJCAI 2023] Black-box Prompt Tuning for Vision-Language Model as a Service☆15Updated last year
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆40Updated 3 weeks ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆70Updated 2 months ago
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆101Updated 2 weeks ago
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆26Updated 2 weeks ago
- ☆78Updated last week
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆56Updated last month
- ☆16Updated 3 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆103Updated 6 months ago
- The code of RouterDC☆33Updated last month
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆10Updated 2 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆53Updated 3 weeks ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆50Updated 2 months ago
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆52Updated 3 months ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆48Updated 2 weeks ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆33Updated last month
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆34Updated this week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆63Updated last month
- ☆16Updated last month
- [Arxiv 2024] Adversarial attacks on multimodal agents☆39Updated 4 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆62Updated 3 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆55Updated 3 months ago