☆97Jan 30, 2026Updated 4 months ago
Alternatives and similar repositories for baxbench
Users that are interested in baxbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Feb 3, 2025Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆83Apr 28, 2026Updated last month
- ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.☆41Jul 20, 2025Updated 10 months ago
- ☆59Feb 24, 2026Updated 3 months ago
- ☆54Jul 16, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆11Nov 24, 2025Updated 6 months ago
- Guardrails for secure and robust agent development☆427Jan 12, 2026Updated 5 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆113Mar 12, 2024Updated 2 years ago
- The library for symbolic interval☆23Jun 23, 2020Updated 5 years ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆41Jul 21, 2025Updated 10 months ago
- Constrained Decoding of Diffusion LLMs with Context-Free Grammars.☆52Dec 17, 2025Updated 5 months ago
- Generating Adversarial Examples for Holding Robustness of Source Code Processing Models☆17Dec 2, 2021Updated 4 years ago
- ☆21Aug 30, 2022Updated 3 years ago
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆54Jul 27, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025☆39Aug 24, 2025Updated 9 months ago
- This JavaScript CLI "undeletes' packages that have been removed from the NPM registry☆32Apr 29, 2026Updated last month
- The Swiss Federal Chancellery Fedlex portal (www.fedlex.admin.ch) crawled, prettified and presented as a git repository.☆22Updated this week
- PyOgmios is a Python version of the Ogmios Client☆10Jun 17, 2025Updated 11 months ago
- ☆13Jul 8, 2023Updated 2 years ago
- Enhacing Code Pre-trained Models by Contrastive Learning☆40Mar 8, 2023Updated 3 years ago
- ☆73Feb 16, 2025Updated last year
- ☆20Apr 10, 2025Updated last year
- ☆97Mar 6, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Challenge Problem #1 - Linux Kernel (NOTE: This code does not reflect the active state of what will be used at competition time, please r…☆58Apr 3, 2024Updated 2 years ago
- Infrastructure-as-code for a serverless knowledge base using Amazon Bedrock, Aurora PostgreSQL (with pgvector), Lambda, and S3. This setu…☆19Mar 23, 2025Updated last year
- Code for the paper "Firewalls to Secure Dynamic LLM Agentic Networks"☆30Jun 6, 2025Updated last year
- Clover: Closed-Loop Verifiable Code Generation☆47May 12, 2025Updated last year
- ☆40Feb 14, 2026Updated 4 months ago
- ☆14Feb 5, 2024Updated 2 years ago
- A low-cost approach to testing AI chat experiences and security concepts☆40May 30, 2026Updated 2 weeks ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 3 years ago
- Evaluation of packer type estimation/detection tools☆14Mar 24, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆17Mar 22, 2024Updated 2 years ago
- An Algorithm to Quantify Robustness of Recurrent Neural Networks☆49Apr 24, 2020Updated 6 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- ACL24☆11Jun 7, 2024Updated 2 years ago
- [ICSE'24] An Empirical Study of Data Disruption by Ransomware Attacks☆13Mar 1, 2024Updated 2 years ago
- ☆42Dec 8, 2022Updated 3 years ago