The evaluation framework for the InfiCoder-Eval benchmark.
☆21Jul 22, 2024Updated last year
Alternatives and similar repositories for infibench-evaluator
Users that are interested in infibench-evaluator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Feb 22, 2024Updated 2 years ago
- The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.☆14Oct 19, 2024Updated last year
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- The Charon tool for analyzing neural network robustness☆13Mar 19, 2020Updated 6 years ago
- ☆16Nov 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- The official repo for GCP-CROWN paper☆13Sep 26, 2022Updated 3 years ago
- ☆26Aug 23, 2024Updated last year
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- This is the implementation repository of our incoming ESEC/FSE 2021 paper: Exposing Numerical Bugs in Deep Learning via GradientBack-prop…☆15Oct 16, 2022Updated 3 years ago
- ☆22Apr 15, 2022Updated 4 years ago
- A simple OS X app to remind the user to take breaks during work.☆20Mar 11, 2015Updated 11 years ago
- GAN based 3D Object Reconstruction in Point Cloud // 基于点云生成对抗网络的三维重建研究☆19Jun 25, 2018Updated 7 years ago
- Keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on popular da…☆19Jun 12, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- The Matlab Code for the AISTATS 2015 paper "Learning Deep Sigmoid Belief Network with Data Augmentation"☆13Sep 20, 2015Updated 10 years ago
- Code for MGDM algorithm, ICML 2025, https://arxiv.org/abs/2502.03332☆16May 19, 2025Updated last year
- ☆15Jul 9, 2025Updated 11 months ago
- Deadline countdowns for academic conferences relevant to the SSE chair.☆13Updated this week
- ☆33Jun 24, 2024Updated last year
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- ☆16Jul 29, 2025Updated 10 months ago
- ☆57Jun 26, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.☆10Sep 6, 2022Updated 3 years ago
- ☆12Nov 30, 2018Updated 7 years ago
- Fast and Modularized CFG-focused Models☆23Nov 8, 2023Updated 2 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆20Nov 16, 2022Updated 3 years ago
- ☆15Oct 2, 2024Updated last year
- ☆21May 24, 2024Updated 2 years ago
- SuperDebug,debug如此简单!☆17Jul 19, 2022Updated 3 years ago
- β-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Verification☆31Nov 9, 2021Updated 4 years ago
- Repository for the Adversarial ML on Code things☆16Jun 25, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- This repository contains the implementation and the evaluation of our ESEC/FSE 2020 paper: Detecting Numerical Bugs in Neural Network Ar…☆25Dec 17, 2020Updated 5 years ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- The implement of ACL2024: "MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization"☆43Jun 15, 2024Updated last year
- CC: Causality-Aware Coverage Criterion for Deep Neural Networks☆12Feb 15, 2023Updated 3 years ago
- Generate pydantic models from JSON Schema☆24Sep 19, 2023Updated 2 years ago
- Boilerplate templates for common Erlang OTP behaviors.☆17Mar 4, 2011Updated 15 years ago