[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆24Aug 25, 2025Updated 6 months ago
Alternatives and similar repositories for Language-Imbalance-Driven-Rewarding
Users that are interested in Language-Imbalance-Driven-Rewarding are comparing it to the libraries listed below
Sorting:
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆19Oct 22, 2025Updated 4 months ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 9 months ago
- Description for MV-MATH☆15Jul 20, 2025Updated 7 months ago
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 10 years ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 7 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆31Feb 26, 2025Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last month
- ☆14Apr 21, 2023Updated 2 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 10 months ago
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆202Nov 30, 2025Updated 3 months ago
- ☆18May 5, 2021Updated 4 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Nov 17, 2024Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆60Jul 23, 2024Updated last year
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- ☆26May 29, 2022Updated 3 years ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆71Apr 2, 2025Updated 11 months ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆112Jul 17, 2025Updated 7 months ago
- MICCAI 2024 code for the paper: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing. EchoNet-Synthetic i…☆36Jun 16, 2025Updated 8 months ago
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆36Dec 22, 2024Updated last year
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- TadGAN for T.F 2.0☆29Mar 22, 2022Updated 3 years ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Sep 30, 2024Updated last year
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆54Apr 6, 2025Updated 10 months ago
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- [ICLR 2025] Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate☆17Apr 22, 2025Updated 10 months ago
- ☆31Mar 24, 2023Updated 2 years ago
- 这是我的博客《不用框架,使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。☆10Jul 1, 2019Updated 6 years ago
- Classification of human emotion using multi-modal models☆12Jun 27, 2020Updated 5 years ago
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- The implement of ACL2024: "MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization"☆43Jun 15, 2024Updated last year
- ☆13Jan 14, 2026Updated last month
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated last year
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆51Nov 8, 2024Updated last year
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated 2 weeks ago
- This is the official repo for GraphRAG-Bench: Challenging Domain-Specific Reasoning for Evaluating Graph Retrieval-Augmented Generation☆66Aug 1, 2025Updated 7 months ago