ShilinSun / mxai_reviewLinks

☆10

Alternatives and similar repositories for mxai_review

Users that are interested in mxai_review are comparing it to the libraries listed below

Sorting:

wangxu0820 / NegativePrompt
The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…
☆22Updated last year
Zhiyuan-Li-John / MuCR
MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities
☆15Updated last month
luka-group / vlm-knowledge-conflict
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
☆42Updated 8 months ago
HITsz-TMG / VisionGraph
The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…
☆14Updated last year
d-ailin / CLIP-Guided-Decoding
☆17Updated 10 months ago
UNITES-Lab / Flex-MoE
[NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"
☆55Updated 2 weeks ago
waltonfuture / RL-with-Cold-Start
SFT+RL boosts multimodal reasoning
☆14Updated this week
Alsace08 / OOD-Math-Reasoning
[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆26Updated last year
reml-group / DoG
☆20Updated 2 months ago
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆34Updated 5 months ago
THUDM / MoELoRA_Riemannian
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆24Updated 2 months ago
SophieZheng998 / ALI-Agent
Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"
☆18Updated last month
tianyi-lab / Mosaic-IT
[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
☆19Updated this week
NUS-HPC-AI-Lab / GEOM
Pytorch implementation of ICML-2024 "Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching"
☆24Updated last year
lixinustc / GraphAdapter
The efficient tuning method for VLMs
☆80Updated last year
ruthless-man / Awesome-Learn-from-Model
Awesome Learn From Model Beyond Fine-Tuning: A Survey
☆67Updated 6 months ago
Chengsong-Huang / Self-Calibration
codes for Efficient Test-Time Scaling via Self-Calibration
☆14Updated 3 months ago
WisdomShell / RewardAnything
RewardAnything: Generalizable Principle-Following Reward Models
☆22Updated 2 weeks ago
OpenCausaLab / CELLO
☆21Updated 7 months ago
zzwjames / FailureLLMUnlearning
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
☆27Updated 4 months ago
waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆46Updated last month
StevenZHB / CoT_Causal_Analysis
Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"
☆22Updated 4 months ago
chendl02 / Awesome-LLM-Causal-Reasoning
[NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.
☆66Updated 4 months ago
aeroplanepaper / GRPO-LEAD
☆18Updated last month
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆59Updated last year
QingyangZhang / EMPO
EMPO, A Fully Unsupervised RLVR Method
☆40Updated 2 weeks ago
Raibows / CREAM
Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.
☆22Updated 4 months ago
lyan62 / FoodieQA
Official Repo for FoodieQA paper (EMNLP 2024)
☆16Updated this week
maxxu05 / openreview_summarizereviews
Summarizing Mean Review Score for All Submissions for a Conference hosted on Openreview
☆22Updated last year
tsinghua-fib-lab / SmartAgent
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Updated 3 months ago