BlueWhaleLab / DCScoreLinks
☆11Updated last month
Alternatives and similar repositories for DCScore
Users that are interested in DCScore are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆27Updated last month
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆27Updated this week
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆13Updated 2 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 4 months ago
- Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…☆25Updated 2 months ago
- Code for "Merging Text Transformers from Different Initializations"☆20Updated 5 months ago
- ☆18Updated 4 months ago
- A trainable user simulator☆34Updated 2 weeks ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 3 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated 9 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 9 months ago
- Tasks for describing differences between text distributions.☆16Updated 11 months ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆55Updated last year
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 9 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated last week
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆21Updated last year
- ☆15Updated 4 months ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆16Updated 3 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- ☆19Updated 4 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- implementation of dualformer☆18Updated 4 months ago