☆38Jan 17, 2025Updated last year
Alternatives and similar repositories for StanfordClashEval
Users that are interested in StanfordClashEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated 2 years ago
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 6 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆157Sep 21, 2024Updated last year
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 4 months ago
- Interactive notebooks with tutorials for the agentpy package.☆15Feb 1, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 3 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 6 months ago
- vLLM client with minimal dependencies☆15Feb 28, 2024Updated 2 years ago
- ☆18May 14, 2025Updated last year
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆19Mar 11, 2025Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆118Jun 8, 2025Updated 11 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆200Sep 13, 2025Updated 8 months ago
- LLM Beam Search Example Implementation☆13May 3, 2024Updated 2 years ago
- ☆12Mar 7, 2024Updated 2 years ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- Official Repository for the ICLR 2022 paper "Generalization of Neural Combinatorial Solvers through the Lens of Adversarial Robustness"☆13Nov 20, 2022Updated 3 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆47Apr 23, 2026Updated 3 weeks ago
- ☆11May 17, 2024Updated 2 years ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆80Nov 19, 2024Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 11 months ago
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated 10 months ago
- Code for Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks (NeurIPS 2025)☆20Mar 16, 2026Updated 2 months ago
- ☆13Apr 3, 2026Updated last month
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 2 months ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆13Apr 29, 2024Updated 2 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated 11 months ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Oct 12, 2024Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- ☆24Apr 29, 2026Updated 3 weeks ago
- ☆15Jul 24, 2022Updated 3 years ago
- ☆13Feb 8, 2025Updated last year
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆31Jan 3, 2026Updated 4 months ago
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆48Feb 11, 2026Updated 3 months ago