The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.
☆14Oct 19, 2024Updated last year
Alternatives and similar repositories for infibench-evaluation-harness
Users that are interested in infibench-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- Artifact of the ICSE 2020 paper: "ReluDiff: Differential Verification of Deep Neural Networks"☆11Feb 1, 2022Updated 4 years ago
- The Charon tool for analyzing neural network robustness☆13Mar 19, 2020Updated 6 years ago
- ☆11Oct 18, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆28Jun 2, 2026Updated last week
- Simple Debugger to run Windbg Commands and also query .NET CLR Runtime data in C#☆25Jun 2, 2022Updated 4 years ago
- The official repo for GCP-CROWN paper☆13Sep 26, 2022Updated 3 years ago
- ☆19Jul 15, 2023Updated 2 years ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- This is the implementation repository of our incoming ESEC/FSE 2021 paper: Exposing Numerical Bugs in Deep Learning via GradientBack-prop…☆15Oct 16, 2022Updated 3 years ago
- ☆22Apr 15, 2022Updated 4 years ago
- A simple OS X app to remind the user to take breaks during work.☆20Mar 11, 2015Updated 11 years ago
- Helper functions for React Context API inspired by @reduxjs/toolkit☆11Nov 25, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆61Sep 17, 2025Updated 8 months ago
- Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)☆10Sep 23, 2021Updated 4 years ago
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- GAN based 3D Object Reconstruction in Point Cloud // 基于点云生成对抗网络的三维重建研究☆19Jun 25, 2018Updated 7 years ago
- 🧩Using backtracking algorithm to solve binary puzzles☆11Jul 17, 2021Updated 4 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- Keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular da…☆19Jun 12, 2024Updated 2 years ago
- calibrate camera with openCvSharp4☆11Jun 11, 2021Updated 5 years ago
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A collection of publications that works on code models but beyond focusing on the accuracies.☆12Jun 30, 2023Updated 2 years ago
- Local test cases for SysY compilers, used by compiler-dev.☆27Mar 6, 2026Updated 3 months ago
- Sample clients for building applications using NVIDIA AI for Media NIMs☆24Jun 1, 2026Updated last week
- ☆33Jun 12, 2023Updated 3 years ago
- A complete testcase generator for online judges.☆12Nov 1, 2025Updated 7 months ago
- A prompt injection game to collect data for robust ML research☆70Jan 27, 2025Updated last year
- ☆57Jun 26, 2025Updated 11 months ago
- ☆10Jan 4, 2023Updated 3 years ago
- An implementation of the paper "Leveraging ParsBERT for cross-domain polarity sentiment classification of Persian social media comments" …☆10Jul 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 用 TypeScript 实现的基础数据结构☆10Jul 25, 2021Updated 4 years ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆40Sep 4, 2024Updated last year
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- UCloud SDK for PHP☆10Updated this week
- ☆10May 13, 2026Updated last month
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆40May 15, 2024Updated 2 years ago