GuoqingWang1 / IGPOView external linksLinks
☆32Feb 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for IGPO
Users that are interested in IGPO are comparing it to the libraries listed below
Sorting:
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆21Dec 16, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last week
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆42Sep 26, 2025Updated 4 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆24Dec 14, 2025Updated 2 months ago
- ☆26Aug 21, 2025Updated 5 months ago
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 5 months ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 8 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Oct 2, 2025Updated 4 months ago
- Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models☆28Oct 6, 2025Updated 4 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 8 months ago
- ☆21Feb 15, 2024Updated 2 years ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 9 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆42Sep 3, 2025Updated 5 months ago
- Reproducible Language Agent Research☆33Jun 25, 2025Updated 7 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Sep 24, 2024Updated last year
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆50Jan 5, 2026Updated last month
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆38Sep 26, 2025Updated 4 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Dec 8, 2023Updated 2 years ago
- Official Code Release for "Training a Generally Curious Agent"☆44May 18, 2025Updated 8 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆35Jun 13, 2025Updated 8 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆45Jun 12, 2025Updated 8 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated 2 weeks ago
- ☆18Jun 10, 2025Updated 8 months ago
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆38Jun 24, 2025Updated 7 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Jun 4, 2025Updated 8 months ago
- Large-scale semi-supervised framework with 1B+ labeled masks from 48K+ datasets with test-time adaptation to new domains (ICCV25).☆43Dec 28, 2025Updated last month
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Dec 13, 2024Updated last year
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 3 months ago
- papers about recommender system.☆10May 18, 2021Updated 4 years ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆28Oct 23, 2025Updated 3 months ago
- ☆26Feb 6, 2026Updated last week
- ☆11Jun 22, 2025Updated 7 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year