luka-group / CoINLinks
☆12Updated last year
Alternatives and similar repositories for CoIN
Users that are interested in CoIN are comparing it to the libraries listed below
Sorting:
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Updated last year
- ☆76Updated last year
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆25Updated last year
- ☆31Updated 10 months ago
- ☆26Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated 2 years ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Updated 7 months ago
- ☆54Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated last year
- ☆51Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Updated last year
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Updated 2 years ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming …☆13Updated 2 weeks ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆63Updated last year
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆72Updated 5 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆118Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆25Updated 11 months ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆40Updated 8 months ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆13Updated last year
- ☆28Updated last year
- ☆19Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆79Updated last year
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Updated 3 years ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆67Updated last year
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆117Updated 6 months ago
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆11Updated last year
- ☆20Updated last year