MaHuanAAA / logtokuLinks
☆35Updated 5 months ago
Alternatives and similar repositories for logtoku
Users that are interested in logtoku are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆38Updated 6 months ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆60Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆64Updated 9 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆100Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Updated last year
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆92Updated 2 months ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Updated last year
- [ICML2025] Test-Time Learning for Large Language Models☆39Updated 5 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Updated 4 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆143Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆21Updated 3 weeks ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆30Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆76Updated 11 months ago
- A Sober Look at Language Model Reasoning☆92Updated 2 months ago
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 6 months ago
- ☆25Updated 9 months ago
- Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty☆50Updated 5 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- ☆141Updated 10 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆89Updated 11 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year
- This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)☆18Updated 4 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆27Updated last year
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆85Updated 7 months ago
- [ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"☆49Updated 3 months ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆203Updated this week
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆102Updated last year