WebPAI / MRWebLinks
☆35Updated 10 months ago
Alternatives and similar repositories for MRWeb
Users that are interested in MRWeb are comparing it to the libraries listed below
Sorting:
- basically all the things I used for this article☆25Updated last year
- ☆33Updated 11 months ago
- ☆40Updated last year
- MTTM: Metamorphic Testing for Textual Content Moderation Software☆32Updated 2 years ago
- Multilingual safety benchmark for Large Language Models☆54Updated last year
- Code and data for the paper: On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs☆128Updated 2 weeks ago
- Code and data for the paper: Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans☆119Updated 2 weeks ago
- ☆60Updated last year
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆18Updated 10 months ago
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆131Updated last month
- ☆38Updated last year
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆35Updated last week
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆25Updated last year
- ☆17Updated 3 months ago
- [ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation☆77Updated 2 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Updated last year
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Updated 7 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 6 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆62Updated 8 months ago
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆95Updated last week
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆110Updated 6 months ago
- ☆20Updated 3 months ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆130Updated last week
- Code for Research Project TLDR☆25Updated 6 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆50Updated 2 years ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆108Updated 8 months ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆14Updated last year
- ☆88Updated last year
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆136Updated 6 months ago