[COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
☆22Oct 13, 2024Updated last year
Alternatives and similar repositories for LLM-Robustness-to-Irrelevant-Information
Users that are interested in LLM-Robustness-to-Irrelevant-Information are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆65Apr 6, 2026Updated last week
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- ☆34Dec 11, 2024Updated last year
- ☆11May 24, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆10Mar 19, 2024Updated 2 years ago
- A tool library for riichi mahjong written in Rust, made mostly to be used as a WASM component.☆13Aug 29, 2025Updated 7 months ago
- Multimodal RAG using LlamaIndex, Qdrant, llama.cpp for document QA with local VisonLLM and embedding models☆18Nov 8, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"☆11Nov 10, 2020Updated 5 years ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated last year
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆44Mar 23, 2026Updated 3 weeks ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- pytorch-TripletSemiHardLoss☆10Jan 12, 2022Updated 4 years ago
- Code for Engel, Grossmann & Ockenfels☆17Jan 2, 2026Updated 3 months ago
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆20Dec 13, 2024Updated last year
- Portable TCP/UDP/ICMP traceroute tool, written in Python☆17Apr 18, 2020Updated 6 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- ☆11Mar 13, 2023Updated 3 years ago
- ☆10Apr 24, 2022Updated 3 years ago
- ROUGE for multilingual Summarization☆25Oct 11, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 多语言降噪预训练模型MBart的 中文生成任务☆11May 27, 2021Updated 4 years ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆58Feb 4, 2026Updated 2 months ago
- ☆20Nov 4, 2023Updated 2 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12May 26, 2022Updated 3 years ago
- ☆22May 7, 2025Updated 11 months ago
- Analyzing Latent Concept in Pre-trained Transformer Models☆12Jul 18, 2022Updated 3 years ago