☆94Jan 30, 2026Updated 3 months ago
Alternatives and similar repositories for baxbench
Users that are interested in baxbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Feb 3, 2025Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆76Mar 23, 2026Updated last month
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆10Nov 24, 2025Updated 5 months ago
- Guardrails for secure and robust agent development☆413Jan 12, 2026Updated 3 months ago
- ☆73Nov 7, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jun 24, 2025Updated 10 months ago
- The library for symbolic interval☆22Jun 23, 2020Updated 5 years ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆113Mar 12, 2024Updated 2 years ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆42Jul 21, 2025Updated 9 months ago
- ☆129Jul 14, 2024Updated last year
- Robust Cross-lingual Embeddings from Parallel Sentences☆22Jun 27, 2020Updated 5 years ago
- Constrained Decoding of Diffusion LLMs with Context-Free Grammars.☆45Dec 17, 2025Updated 4 months ago
- ☆22May 23, 2025Updated 11 months ago
- ☆21Aug 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A dashboard that show the relationships between urban spaces and their networks of design, production and consumption with Maker initiati…☆14Nov 10, 2017Updated 8 years ago
- G'n'T Eval is an evaluation suite that allows to carry out pen and paper evaluations. It ships with all necessary tools, i.e. management …☆14Nov 2, 2013Updated 12 years ago
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆54Jul 27, 2025Updated 9 months ago
- Notes and insights about OpenAI's Code Interpreter☆13Jul 26, 2023Updated 2 years ago
- 🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025☆37Aug 24, 2025Updated 8 months ago
- A Python obfuscator using Python's abstract Syntax trees to change all variable names to different Unicode variations of X.☆11Jun 27, 2020Updated 5 years ago
- Solutions to math olympiad problems in Isabelle/HOL☆11May 29, 2021Updated 4 years ago
- The Swiss Federal Chancellery Fedlex portal (www.fedlex.admin.ch) crawled, prettified and presented as a git repository.☆21Updated this week
- a proof-of-concept of a diverse double compilation☆13Feb 10, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A project (LLM Sentinel) that showcases NVIDIA's NeMo-Guardrails and LangChain for improving LLM safety☆12Jan 22, 2025Updated last year
- [NeurIPS 2019] H. Chen*, H. Zhang*, S. Si, Y. Li, D. Boning and C.-J. Hsieh, Robustness Verification of Tree-based Models (*equal contrib…☆27Jun 15, 2019Updated 6 years ago
- A compiler and bytecode interpreter for a subset of Python☆10Jan 23, 2021Updated 5 years ago
- ☆12Jul 8, 2023Updated 2 years ago
- Enhacing Code Pre-trained Models by Contrastive Learning☆39Mar 8, 2023Updated 3 years ago
- Use any program to perform fixups for afl via AFL_POST_LIBRARY☆11Aug 31, 2020Updated 5 years ago
- This is the replication package of V-SZZ, which has been accepted by ICSE2022☆15Jan 19, 2026Updated 3 months ago
- ☆94Mar 6, 2026Updated last month
- Challenge Problem #1 - Linux Kernel (NOTE: This code does not reflect the active state of what will be used at competition time, please r…☆59Apr 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Home of course "Programmable Society" at KTH Royal Institute of Technology☆21Dec 12, 2025Updated 4 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated last year
- ☆14Feb 5, 2024Updated 2 years ago
- Cobra-W -> Cobra-RE 将进一步提升漏洞发现的准确性并降低漏报率(弃坑了)☆16Aug 15, 2020Updated 5 years ago
- Building self-refined guardrails via DSPy☆14Jul 2, 2024Updated last year
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago