The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.
☆14Oct 19, 2024Updated last year
Alternatives and similar repositories for infibench-evaluation-harness
Users that are interested in infibench-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- Artifact of the ICSE 2020 paper: "ReluDiff: Differential Verification of Deep Neural Networks"☆11Feb 1, 2022Updated 4 years ago
- ☆28Nov 10, 2025Updated 6 months ago
- The official repo for GCP-CROWN paper☆13Sep 26, 2022Updated 3 years ago
- A simple OS X app to remind the user to take breaks during work.☆20Mar 11, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)☆10Sep 23, 2021Updated 4 years ago
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- LLM Prompting for Text2SQL via Gradual SQL Reffnement☆15Feb 19, 2025Updated last year
- ☆33Jun 12, 2023Updated 2 years ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆38Sep 4, 2024Updated last year
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- ☆10May 13, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17May 25, 2020Updated 5 years ago
- Fork of uws for Socket.IO☆12Jun 7, 2020Updated 5 years ago
- β-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Verification☆31Nov 9, 2021Updated 4 years ago
- [PLDI 19'] An Inductive Synthesis Framework for Verifiable Reinforcement Learning☆14Jan 14, 2020Updated 6 years ago
- BetterDiscord Installer☆10Mar 8, 2019Updated 7 years ago
- Topaz Photo AI upscaler inside sd-webui☆12Jul 5, 2024Updated last year
- Modelling Capture-the-Flag Challenges Using Reinforcement Learning☆15Jul 30, 2022Updated 3 years ago
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288☆20Oct 30, 2024Updated last year
- AMP implementation for quadruped legged robot in IsaacGymEnvs☆14Nov 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Datasets for cybersecurity☆18Aug 12, 2025Updated 9 months ago
- Twitch (OAuth) authentication strategies for Passport.☆10Apr 18, 2024Updated 2 years ago
- Help people understand the ZKP mooc course of Berkeley☆14Feb 10, 2023Updated 3 years ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆86Jan 12, 2025Updated last year
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆20Aug 10, 2021Updated 4 years ago
- The team at Bosch were working on a mapping of SPDX and CycloneDX on both property level (= syntax) and a semantic interpretation of the …☆16Jan 26, 2026Updated 3 months ago
- ☆17Jan 5, 2023Updated 3 years ago
- ☆19Dec 15, 2022Updated 3 years ago
- An ES6 template tag which escapes parameters for interpolation into shell commands☆15Jan 6, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Apr 14, 2021Updated 5 years ago
- Reddit API Wrapper (NPM Package: reddit-wrapper-v2)☆13Apr 5, 2019Updated 7 years ago
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆19Feb 20, 2026Updated 3 months ago
- Different approaches for finetuning, evaluating, optimizations for code generation model - codestral☆11Jun 18, 2024Updated last year
- Hybrid action space reinforcement learning algorithms.☆14Mar 26, 2021Updated 5 years ago
- ☆59Dec 12, 2025Updated 5 months ago
- Code and data for EMNLP-IJCNLP 2019 paper "Are You for Real? Detecting Identity Fraud via Dialogue Interactions"☆16Aug 20, 2019Updated 6 years ago