The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.
☆14Oct 19, 2024Updated last year
Alternatives and similar repositories for infibench-evaluation-harness
Users that are interested in infibench-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- ☆11Oct 18, 2022Updated 3 years ago
- ☆28Nov 10, 2025Updated 5 months ago
- The official repo for GCP-CROWN paper☆13Sep 26, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- This is the implementation repository of our incoming ESEC/FSE 2021 paper: Exposing Numerical Bugs in Deep Learning via GradientBack-prop…☆15Oct 16, 2022Updated 3 years ago
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- Keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular da…☆19Jun 12, 2024Updated last year
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Jun 30, 2023Updated 2 years ago
- ☆54Jun 26, 2025Updated 9 months ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆38Sep 4, 2024Updated last year
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- ☆10Mar 24, 2026Updated 3 weeks ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆38May 15, 2024Updated last year
- [PLDI 19'] An Inductive Synthesis Framework for Verifiable Reinforcement Learning☆14Jan 14, 2020Updated 6 years ago
- This is a ROS catkin workspace for a robot in frc☆14Dec 16, 2020Updated 5 years ago
- ☆20Jun 12, 2024Updated last year
- Modelling Capture-the-Flag Challenges Using Reinforcement Learning☆15Jul 30, 2022Updated 3 years ago
- ☆22Jul 1, 2024Updated last year
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Datasets for cybersecurity☆16Aug 12, 2025Updated 8 months ago
- Help people understand the ZKP mooc course of Berkeley☆14Feb 10, 2023Updated 3 years ago
- This is a Unity project that works as a simulator for the ROS FRC robot code hosted in the robot-frc repo.☆14Oct 11, 2020Updated 5 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Aug 10, 2021Updated 4 years ago
- Adam with minor modifications which give significant improvement☆19Aug 20, 2021Updated 4 years ago
- The team at Bosch were working on a mapping of SPDX and CycloneDX on both property level (= syntax) and a semantic interpretation of the …☆16Jan 26, 2026Updated 2 months ago
- ☆17Jan 5, 2023Updated 3 years ago
- ☆19Dec 15, 2022Updated 3 years ago
- ☆18Apr 14, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆19Feb 20, 2026Updated last month
- ☆15Jul 29, 2022Updated 3 years ago
- [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk☆15Mar 15, 2025Updated last year
- ☆59Dec 12, 2025Updated 4 months ago
- This repo is for our submission for ICSE 2025.☆20Jun 12, 2024Updated last year
- Evaluation of source authorship attribution tool☆23Jun 5, 2021Updated 4 years ago
- Simple Semver and SemverRange classes☆16Mar 25, 2026Updated 2 weeks ago