☆38Jan 17, 2025Updated last year
Alternatives and similar repositories for StanfordClashEval
Users that are interested in StanfordClashEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Oct 17, 2024Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆82Apr 12, 2024Updated 2 years ago
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆156Sep 21, 2024Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆31Nov 12, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 3 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 7 months ago
- Text generation from structured data☆10Dec 2, 2019Updated 6 years ago
- For building the world's largest dataset of GPU kernels.☆10Updated this week
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆20Mar 11, 2025Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆118Jun 8, 2025Updated last year
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated 2 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆202Sep 13, 2025Updated 8 months ago
- LLM Beam Search Example Implementation☆13May 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Mar 7, 2024Updated 2 years ago
- Official Repository for the ICLR 2022 paper "Generalization of Neural Combinatorial Solvers through the Lens of Adversarial Robustness"☆13Nov 20, 2022Updated 3 years ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- ☆11May 17, 2024Updated 2 years ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated last year
- ☆13Apr 3, 2026Updated 2 months ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆14Oct 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- ☆24Apr 29, 2026Updated last month
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago
- ☆13Feb 8, 2025Updated last year
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆49Feb 11, 2026Updated 3 months ago
- Test equality between a black-box LLM API and a reference distribution☆18Oct 29, 2024Updated last year
- Open Source Replication of Anthropic's Alignment Faking Paper☆58Apr 4, 2025Updated last year
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for COLING 2020 paper "Improving Document-level Sentiment Analysis with User and Product Context"☆11Apr 13, 2022Updated 4 years ago
- ☆25Jan 11, 2019Updated 7 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 6 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 8 months ago