The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.
☆14Oct 19, 2024Updated last year
Alternatives and similar repositories for infibench-evaluation-harness
Users that are interested in infibench-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- ☆11Oct 18, 2022Updated 3 years ago
- ☆28Nov 10, 2025Updated 4 months ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- This is the implementation repository of our incoming ESEC/FSE 2021 paper: Exposing Numerical Bugs in Deep Learning via GradientBack-prop…☆15Oct 16, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆22Apr 15, 2022Updated 3 years ago
- A simple OS X app to remind the user to take breaks during work.☆20Mar 11, 2015Updated 11 years ago
- Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)☆10Sep 23, 2021Updated 4 years ago
- GAN based 3D Object Reconstruction in Point Cloud // 基于点云生成对抗网络的三维重建研究☆19Jun 25, 2018Updated 7 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 3 years ago
- Keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular da…☆19Jun 12, 2024Updated last year
- ☆53Jun 26, 2025Updated 8 months ago
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated last year
- ☆32Jun 12, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆36Sep 4, 2024Updated last year
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆38May 15, 2024Updated last year
- ☆17May 25, 2020Updated 5 years ago
- [PLDI 19'] An Inductive Synthesis Framework for Verifiable Reinforcement Learning☆14Jan 14, 2020Updated 6 years ago
- This repository contains the implementation and the evaluation of our ESEC/FSE 2020 paper: Detecting Numerical Bugs in Neural Network Ar…☆25Dec 17, 2020Updated 5 years ago
- ☆21Jun 12, 2024Updated last year
- ☆22Jul 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- AMP implementation for quadruped legged robot in IsaacGymEnvs☆14Nov 30, 2023Updated 2 years ago
- Datasets for cybersecurity☆16Aug 12, 2025Updated 7 months ago
- Help people understand the ZKP mooc course of Berkeley☆14Feb 10, 2023Updated 3 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Aug 10, 2021Updated 4 years ago
- Adam with minor modifications which give significant improvement☆19Aug 20, 2021Updated 4 years ago
- The team at Bosch were working on a mapping of SPDX and CycloneDX on both property level (= syntax) and a semantic interpretation of the …☆16Jan 26, 2026Updated last month
- ☆17Jan 5, 2023Updated 3 years ago
- ☆19Dec 15, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Consistency Regularization for Certified Robustness of Smoothed Classifiers" (NeurIPS 2020)☆35Jan 11, 2021Updated 5 years ago
- ☆18Apr 14, 2021Updated 4 years ago
- Benchmark data for d3rlpy☆21Nov 28, 2023Updated 2 years ago
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆18Feb 20, 2026Updated last month
- ☆15Jul 29, 2022Updated 3 years ago
- Code and data for EMNLP-IJCNLP 2019 paper "Are You for Real? Detecting Identity Fraud via Dialogue Interactions"☆16Aug 20, 2019Updated 6 years ago
- This repo is for our submission for ICSE 2025.☆20Jun 12, 2024Updated last year