A certifier for bias in LLMs
☆25Apr 11, 2025Updated last year
Alternatives and similar repositories for LLMCert-B
Users that are interested in LLMCert-B are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆19May 31, 2026Updated last week
- ☆15Jun 25, 2025Updated 11 months ago
- ☆12Apr 25, 2025Updated last year
- ☆15Jul 24, 2022Updated 3 years ago
- ☆14Oct 17, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Test equality between a black-box LLM API and a reference distribution☆18Oct 29, 2024Updated last year
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021☆13Jun 22, 2021Updated 4 years ago
- ☆14Jun 6, 2023Updated 3 years ago
- ☆13Oct 14, 2020Updated 5 years ago
- A very limited implementation of arXiv:1904.00759☆13Dec 2, 2019Updated 6 years ago
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆74May 31, 2026Updated last week
- Remake for SUSTech_CS305 codes in 2019 Fall.☆15Jun 26, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Mar 19, 2023Updated 3 years ago
- This repo contains the source code for reproducing the experimental results in semantic density paper (Neurips 2024)☆20Sep 28, 2025Updated 8 months ago
- Official codebase for permutation self-consistency.☆19Feb 11, 2024Updated 2 years ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆21Jul 6, 2021Updated 4 years ago
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆31Dec 20, 2024Updated last year
- [EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"☆23Jun 29, 2024Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆105Dec 1, 2025Updated 6 months ago
- Code for NeurIPS 2024 Paper - Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass☆21Aug 22, 2024Updated last year
- A new algorithm that formulates jailbreaking as a reasoning problem.☆26Jul 2, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆30Sep 15, 2024Updated last year
- ☆21Sep 13, 2021Updated 4 years ago
- Tensorflow implementation of Meta Adversarial Training for Adversarial Patch Attacks on Tiny ImageNet.☆26Jan 28, 2021Updated 5 years ago
- ☆42Aug 20, 2025Updated 9 months ago
- Code to generate NeuralExecs (prompt injection for LLMs)☆27Oct 5, 2025Updated 8 months ago
- Application of CollaGAN (Collaborative GAN) for MRI Image Imputation☆28Dec 8, 2019Updated 6 years ago
- ☆21Oct 23, 2024Updated last year
- ☆33Feb 10, 2025Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Dec 14, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fine-tuning base models to build robust task-specific models☆36Apr 11, 2024Updated 2 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆88Mar 2, 2021Updated 5 years ago
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆23Oct 22, 2024Updated last year
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- ☆37Feb 12, 2025Updated last year
- Open Source Replication of Anthropic's Alignment Faking Paper☆58Apr 4, 2025Updated last year
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 3 years ago