The evaluation framework for the InfiCoder-Eval benchmark.
☆21Jul 22, 2024Updated last year
Alternatives and similar repositories for infibench-evaluator
Users that are interested in infibench-evaluator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.☆13Jan 5, 2024Updated 2 years ago
- ☆11Oct 18, 2022Updated 3 years ago
- The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.☆14Oct 19, 2024Updated last year
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Artifact of the ICSE 2020 paper: "ReluDiff: Differential Verification of Deep Neural Networks"☆11Feb 1, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Charon tool for analyzing neural network robustness☆13Mar 19, 2020Updated 6 years ago
- The official repo for GCP-CROWN paper☆13Sep 26, 2022Updated 3 years ago
- ☆26Aug 23, 2024Updated last year
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- This is the implementation repository of our incoming ESEC/FSE 2021 paper: Exposing Numerical Bugs in Deep Learning via GradientBack-prop…☆15Oct 16, 2022Updated 3 years ago
- ☆22Apr 15, 2022Updated 4 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular da…☆19Jun 12, 2024Updated last year
- ☆13Feb 14, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Apr 8, 2021Updated 5 years ago
- The Matlab Code for the AISTATS 2015 paper "Learning Deep Sigmoid Belief Network with Data Augmentation"☆13Sep 20, 2015Updated 10 years ago
- Extracts static code features from opencl kernels to be used for machine learning.☆10Apr 30, 2021Updated 5 years ago
- Code for MGDM algorithm, ICML 2025, https://arxiv.org/abs/2502.03332☆16May 19, 2025Updated last year
- ☆15Jul 9, 2025Updated 10 months ago
- Deadline countdowns for academic conferences relevant to the SSE chair.☆13Feb 10, 2026Updated 3 months ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆33Jun 24, 2024Updated last year
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- An Empirical Comparison of Unsupervised Constituency Parsing Methods☆14Aug 15, 2021Updated 4 years ago
- Accompanying code for the ProteinGLUE method☆12Apr 12, 2022Updated 4 years ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.☆10Sep 6, 2022Updated 3 years ago
- ☆12Nov 30, 2018Updated 7 years ago
- Fast and Modularized CFG-focused Models☆23Nov 8, 2023Updated 2 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆20Nov 16, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository contains the replication package of our paper "Assessing the Security of GitHub Copilot’s Generated Code - A Targeted Rep…☆10Nov 16, 2023Updated 2 years ago
- ☆12Nov 14, 2021Updated 4 years ago
- ☆15Oct 2, 2024Updated last year
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆13Mar 5, 2025Updated last year
- [MICCAI2022] Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores.☆12Jun 7, 2024Updated last year
- ☆21May 24, 2024Updated 2 years ago
- ☆16Jun 18, 2024Updated last year