amazon-science / factual-confidence-of-llmsView external linksLinks
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
☆16Dec 4, 2024Updated last year
Alternatives and similar repositories for factual-confidence-of-llms
Users that are interested in factual-confidence-of-llms are comparing it to the libraries listed below
Sorting:
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 10 months ago
- Fine-tuning large language models with huggingface transformers and deepspeed☆31Dec 11, 2023Updated 2 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆144Mar 14, 2024Updated last year
- Collections of IR Research☆37May 18, 2025Updated 9 months ago
- ☆35May 18, 2023Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Feb 5, 2024Updated 2 years ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- ☆11Jul 17, 2023Updated 2 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- Evaluation Pipeline for medical tasks.☆12Updated this week
- ☆11Jun 7, 2023Updated 2 years ago
- ☆12Sep 22, 2024Updated last year
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 3 months ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- The main controller for services in the cs-insights project through docker-compose.☆13Aug 25, 2023Updated 2 years ago
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 2 years ago
- GBM implementation on Legate☆14Jan 28, 2026Updated 3 weeks ago
- EventEA: Benchmarking Entity Alignment for Event-centric Knowledge Graphs☆11May 8, 2022Updated 3 years ago
- ☆12Nov 9, 2018Updated 7 years ago
- ☆14Jan 24, 2025Updated last year
- Transformer + GAT for RNA chemical reactivity prediction| Stanford Ribonanza☆11Jan 28, 2026Updated 2 weeks ago
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆13Jun 1, 2024Updated last year
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 weeks ago
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆11Oct 18, 2021Updated 4 years ago
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆11Jun 25, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Aug 6, 2025Updated 6 months ago
- This is the implementation of the 4th place solution (yu4u's part) for RSNA 2024 Lumbar Spine Degenerative Classification at Kaggle.☆10Oct 11, 2024Updated last year
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆16Feb 11, 2026Updated last week
- Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging☆10Nov 15, 2021Updated 4 years ago
- Joint Metric Learning Network for 2D Sketch-based 3D Shape Retrieval☆11Oct 19, 2020Updated 5 years ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- ☆11Sep 27, 2022Updated 3 years ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year