☆39Jan 17, 2025Updated last year
Alternatives and similar repositories for StanfordClashEval
Users that are interested in StanfordClashEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Oct 17, 2024Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated 2 years ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆153Sep 21, 2024Updated last year
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆23Oct 22, 2024Updated last year
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Feb 4, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 3 months ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 3 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 5 months ago
- vLLM client with minimal dependencies☆15Feb 28, 2024Updated 2 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆19Mar 11, 2025Updated last year
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Danmuku dataset☆12Jul 7, 2023Updated 2 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆197Sep 13, 2025Updated 7 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆44Mar 23, 2026Updated 3 weeks ago
- Benchmark of crystal structure prediction algorithms☆15Jun 9, 2025Updated 10 months ago
- ☆12Mar 7, 2024Updated 2 years ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- ☆11May 17, 2024Updated last year
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Apr 25, 2025Updated 11 months ago
- ☆13Apr 3, 2026Updated 2 weeks ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆22Jun 13, 2025Updated 10 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated last year
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆14Oct 12, 2024Updated last year
- Test equality between a black-box LLM API and a reference distribution☆13Oct 29, 2024Updated last year
- Project for restoring beautiful K-pop Idols Images to high quality.☆14Mar 19, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Jul 24, 2022Updated 3 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆21Sep 24, 2025Updated 6 months ago
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago
- ☆13Feb 8, 2025Updated last year
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆48Feb 11, 2026Updated 2 months ago
- ☆17Oct 2, 2024Updated last year
- Open Source Replication of Anthropic's Alignment Faking Paper☆56Apr 4, 2025Updated last year