Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"
☆24Jul 19, 2024Updated last year
Alternatives and similar repositories for When-to-Retrieve
Users that are interested in When-to-Retrieve are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Sep 2, 2024Updated last year
- [Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation☆23Sep 30, 2024Updated last year
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 4 months ago
- autogen 中文文档☆10Nov 7, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Oct 9, 2023Updated 2 years ago
- Backdooring Neural Code Search☆14Sep 8, 2023Updated 2 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated last year
- ☆79May 22, 2024Updated last year
- Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"☆29May 1, 2025Updated last year
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆73May 5, 2025Updated last year
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 6 months ago
- Reproducible and flexible LLM evaluations for scientific reasoning.☆28Jul 23, 2025Updated 9 months ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Aug 18, 2022Updated 3 years ago
- Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning", Outstanding Paper at EMNLP20…☆10Nov 7, 2021Updated 4 years ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated last year
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- ☆10Oct 17, 2022Updated 3 years ago
- ☆26Oct 9, 2025Updated 6 months ago
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Aug 15, 2025Updated 8 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆100Oct 19, 2023Updated 2 years ago
- ☆12Mar 7, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Website for HKU NLP group (under construction)☆14Mar 20, 2026Updated last month
- ☆13Jul 14, 2024Updated last year
- Federated learning using a mixture of experts☆17Feb 16, 2021Updated 5 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 5 months ago
- 识别工厂中托盘和托盘上的孔☆14Sep 11, 2023Updated 2 years ago
- ☆27Oct 7, 2025Updated 7 months ago
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Apr 20, 2026Updated 2 weeks ago
- Code for the ACL2023 paper: CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning (https://aclant…☆11May 9, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of AdaCQR(COLING 2025)☆15Dec 30, 2024Updated last year
- Code for the CIKM 2019 Paper: How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations☆32Jun 12, 2023Updated 2 years ago
- 强化学习课程,主要是如 何用强化学习解决问题☆15Dec 10, 2024Updated last year
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 3 weeks ago
- MULTITQ is a large-scale dataset featuring ample relevant facts and multiple temporal granularities.☆25Apr 28, 2026Updated last week
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆147Feb 27, 2026Updated 2 months ago
- [ICML 2025] Generalization Principles for Inference over Text-Attributed Graphs with Large Language☆22Jul 15, 2025Updated 9 months ago