A certifier for bias in LLMs
☆25Apr 11, 2025Updated 11 months ago
Alternatives and similar repositories for LLMCert-B
Users that are interested in LLMCert-B are comparing it to the libraries listed below
Sorting:
- Counterexample-Guided Learning of Monotonic Networks☆18May 19, 2022Updated 3 years ago
- ☆15Feb 4, 2020Updated 6 years ago
- An extension to the Java type system to catch badly-behaving builder patterns☆11Feb 13, 2023Updated 3 years ago
- ☆12Apr 25, 2025Updated 10 months ago
- A SAT Solver based on CDCL (Conflict Driven Clause Learning) implemented in python☆23Jan 1, 2021Updated 5 years ago
- Test equality between a black-box LLM API and a reference distribution☆12Oct 29, 2024Updated last year
- ☆15Jul 24, 2022Updated 3 years ago
- ☆14Oct 17, 2024Updated last year
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- 🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025☆36Aug 24, 2025Updated 6 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021☆13Jun 22, 2021Updated 4 years ago
- ☆14Jun 6, 2023Updated 2 years ago
- Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]☆21Apr 15, 2024Updated last year
- Private Adaptive Optimization with Side Information (ICML '22)☆16Jun 23, 2022Updated 3 years ago
- Official codebase for permutation self-consistency.☆18Feb 11, 2024Updated 2 years ago
- ☆20Mar 19, 2023Updated 3 years ago
- This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)☆19Sep 28, 2025Updated 5 months ago
- ☆17Aug 2, 2023Updated 2 years ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆21Jul 6, 2021Updated 4 years ago
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Dec 20, 2024Updated last year
- [EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"☆24Jun 29, 2024Updated last year
- ELINA: ETH LIbrary for Numerical Analysis☆137Apr 7, 2023Updated 2 years ago
- MCP server integrating GEPA (Genetic-Evolutionary Prompt Architecture) for automatic prompt optimization with Claude Desktop☆47Nov 10, 2025Updated 4 months ago
- Code for NeurIPS 2024 Paper - Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass☆21Aug 22, 2024Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆99Dec 1, 2025Updated 3 months ago
- A new algorithm that formulates jailbreaking as a reasoning problem.☆26Jul 2, 2025Updated 8 months ago
- ☆27Sep 15, 2024Updated last year
- ☆22Sep 13, 2021Updated 4 years ago
- Tensorflow implementation of Meta Adversarial Training for Adversarial Patch Attacks on Tiny ImageNet.☆26Jan 28, 2021Updated 5 years ago
- ☆40Aug 20, 2025Updated 7 months ago
- Experimental translation of llvm to smt.☆59Apr 8, 2020Updated 5 years ago
- ☆21Oct 23, 2024Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Dec 14, 2023Updated 2 years ago
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆23Oct 22, 2024Updated last year
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- ☆36Feb 12, 2025Updated last year
- My CS305 Computer Networking Lab Assignments☆30Nov 14, 2018Updated 7 years ago