DAMO-NLP-SG / LLM-Multilingual-Knowledge-BoundariesLinks
[ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
☆13Updated 3 months ago
Alternatives and similar repositories for LLM-Multilingual-Knowledge-Boundaries
Users that are interested in LLM-Multilingual-Knowledge-Boundaries are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29Updated 4 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆18Updated 5 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆30Updated last month
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 9 months ago
- ☆18Updated 2 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 4 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆48Updated 4 months ago
- ☆45Updated last week
- Codebase for Instruction Following without Instruction Tuning☆35Updated last year
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆13Updated 3 weeks ago
- Exploration of automated dataset selection approaches at large scales.☆47Updated 6 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆39Updated last month
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆23Updated last week
- ☆14Updated last year
- Control LLM☆19Updated 5 months ago
- ☆20Updated last year
- ☆19Updated 6 months ago
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners☆25Updated 9 months ago
- ☆23Updated last month
- ☆16Updated last year
- ☆22Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆16Updated 3 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 8 months ago
- SSRL: Self-Search Reinforcement Learning☆144Updated last month
- A holistic benchmark for LLM abstention☆52Updated 3 weeks ago
- ☆27Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆38Updated last week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- ACL24☆10Updated last year