☆38Jan 17, 2025Updated last year
Alternatives and similar repositories for StanfordClashEval
Users that are interested in StanfordClashEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆83Apr 12, 2024Updated 2 years ago
- Open sourced result for The Agent Company☆21Updated this week
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆23Oct 22, 2024Updated last year
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Feb 4, 2026Updated 4 months ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking☆13Apr 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆32Nov 12, 2024Updated last year
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 8 months ago
- Text generation from structured data☆10Dec 2, 2019Updated 6 years ago
- ☆20May 14, 2025Updated last year
- Cross-domain word representation learning☆10May 23, 2015Updated 11 years ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆118Jun 8, 2025Updated last year
- Danmuku dataset☆12Jul 7, 2023Updated 2 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆204Sep 13, 2025Updated 9 months ago
- Benchmark of crystal structure prediction algorithms☆15Jun 9, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Mar 7, 2024Updated 2 years ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- ☆11May 17, 2024Updated 2 years ago
- Final project for the class "Deep Learning Systems Algorithms and Implementation" from CMU, where we try to make needle work with Apple M…☆10Jan 8, 2023Updated 3 years ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated last year
- A Workbench for Autograding Retrieve/Generate Systems☆15Jun 30, 2025Updated last year
- ☆13Apr 3, 2026Updated 2 months ago
- ☆12Apr 25, 2025Updated last year
- Hands and face detection using TensorFlow.js☆13Nov 16, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 4 months ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆13Apr 29, 2024Updated 2 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated 2 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated 2 years ago
- ☆24Apr 29, 2026Updated 2 months ago
- ☆15Jul 24, 2022Updated 3 years ago
- A version of the SUSTAIN model of category learning (Love, Medin, & Gureckis, 2004) implemented in Python☆20Apr 20, 2016Updated 10 years ago
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆50Feb 11, 2026Updated 4 months ago
- ☆18Oct 2, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Test equality between a black-box LLM API and a reference distribution☆18Oct 29, 2024Updated last year
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- Code for COLING 2020 paper "Improving Document-level Sentiment Analysis with User and Product Context"☆11Apr 13, 2022Updated 4 years ago
- SemEval2026 Task 3 DimABSA☆33Jun 8, 2026Updated 3 weeks ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆91Apr 28, 2024Updated 2 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated 2 years ago
- A Text-To-Speech Model Developed Using 🐸STT☆13Jun 22, 2022Updated 4 years ago