uiuc-focal-lab / LLMCert-BLinks
A certifier for bias in LLMs
☆23Updated last month
Alternatives and similar repositories for LLMCert-B
Users that are interested in LLMCert-B are comparing it to the libraries listed below
Sorting:
- Making code edting up to 7.7x faster using multi-layer speculation☆21Updated 3 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆31Updated 11 months ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆39Updated 3 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆77Updated 10 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆141Updated 7 months ago
- CodeGuard+: Constrained Decoding for Secure Code Generation☆11Updated 10 months ago
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆56Updated last year
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆14Updated 2 months ago
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆17Updated 4 months ago
- SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)☆48Updated 10 months ago
- For our ACL25 Paper: Can Language Models Replace Programmers? RepoCod Says ‘Not Yet’ - by Shanchao Liang and Yiran Hu and Nan Jiang and L…☆19Updated last week
- DafnyBench: A Benchmark for Formal Software Verification☆34Updated 5 months ago
- ☆114Updated 10 months ago
- ☆12Updated 9 months ago
- ☆42Updated 10 months ago
- [NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning☆22Updated 6 months ago
- ☆15Updated 11 months ago
- [ICLR 2024]: Is Self-Repair a Silver Bullet for Code Generation?☆13Updated last year
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆49Updated last month
- ☆110Updated 10 months ago
- Clover: Closed-Loop Verifiable Code Generation☆35Updated 3 weeks ago
- [FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods☆47Updated 11 months ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆53Updated last year
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆30Updated 2 years ago
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆60Updated 9 months ago
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆73Updated last month
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆72Updated 2 months ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆98Updated 3 months ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆89Updated last year
- Dataset for the Tensor Trust project☆40Updated last year