Code for the paper: Prompts have evil twins (EMNLP 2024)
☆23Feb 10, 2025Updated last year
Alternatives and similar repositories for evil_twins
Users that are interested in evil_twins are comparing it to the libraries listed below
Sorting:
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- Machine Learning for Healthcare☆10Mar 28, 2020Updated 5 years ago
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 4 months ago
- ☆11Jun 18, 2023Updated 2 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆11Mar 30, 2020Updated 5 years ago
- Introduction to Machine Learning using scikit-learn and PyTorch☆10Sep 26, 2019Updated 6 years ago
- ☆10Jan 28, 2024Updated 2 years ago
- UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs☆11Apr 13, 2023Updated 2 years ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- Wave - The Software as a Service Starter Kit, designed to help you build the SAAS of your dreams 🚀 💰☆12Jan 30, 2026Updated last month
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- Implementation of Stochastic Gradient Descent algorithms in Python (cite https://doi.org/10.1007/s00158-020-02599-z)☆11May 19, 2021Updated 4 years ago
- ☆10Mar 6, 2022Updated 4 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- The GPT-4 function calls used in everchanging quest for the HF game jam☆10Jul 9, 2023Updated 2 years ago
- CloudLLM is a batteries-included Rust toolkit for building intelligent agents with LLM integration, multi-protocol tool support, and mult…☆16Feb 26, 2026Updated last week
- Risky Object Localization (ROL) in a Driving Scene Dataset☆15Dec 24, 2023Updated 2 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- ☆12Sep 24, 2024Updated last year
- Large-scale text embedding model☆38Sep 6, 2025Updated 6 months ago
- NOMU: Neural Optimization-based Model Uncertainty☆10Feb 17, 2023Updated 3 years ago
- PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations☆12Apr 21, 2024Updated last year
- Causal Feature Selection Tutorial for AMIA2018☆12Nov 3, 2018Updated 7 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Supporting material for https://arxiv.org/abs/1907.04769☆12Sep 20, 2021Updated 4 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated last year
- ☆12Oct 28, 2022Updated 3 years ago
- ☆11Jan 24, 2022Updated 4 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆16Feb 9, 2026Updated last month
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆23Mar 4, 2025Updated last year
- ☆10Mar 6, 2024Updated 2 years ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- ☆27Jan 4, 2026Updated 2 months ago
- Active learning in NLP☆14Dec 14, 2022Updated 3 years ago