The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.
☆14Oct 19, 2024Updated last year
Alternatives and similar repositories for infibench-evaluation-harness
Users that are interested in infibench-evaluation-harness are comparing it to the libraries listed below
Sorting:
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- A prompt injection game to collect data for robust ML research☆68Jan 27, 2025Updated last year
- ☆28Nov 10, 2025Updated 3 months ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆36Sep 4, 2024Updated last year
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆38May 15, 2024Updated last year
- calibrate camera with openCvSharp4☆11Jun 11, 2021Updated 4 years ago
- ☆10Sep 29, 2024Updated last year
- Topaz Photo AI upscaler inside sd-webui☆12Jul 5, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- ☆12Jan 11, 2026Updated last month
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Home server set up☆13Oct 5, 2025Updated 5 months ago
- A PHP 5.3+ wrapper to the NCBI/PubMed efetch API☆12Oct 14, 2020Updated 5 years ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆49Nov 29, 2024Updated last year
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- ☆11Oct 15, 2022Updated 3 years ago
- A Prompt Enhancer for flux.1 in ComfyUI☆12Jan 11, 2026Updated last month
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- Simple Cron Jobs Scheduler using environment variables + Bun + TypeScript☆14Dec 9, 2025Updated 2 months ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- BetterDiscord Installer☆10Mar 8, 2019Updated 6 years ago
- ☆10Jan 28, 2026Updated last month
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 3 weeks ago
- Helper functions for React Context API inspired by @reduxjs/toolkit☆11Nov 25, 2022Updated 3 years ago
- Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)☆10Sep 23, 2021Updated 4 years ago
- The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI pipelines for real-time conversations over WebRTC.☆39Updated this week
- Custom Engineered Agents and Tools for Vibe Coders | Agents for TRAE.AI, Smart MCPs, GLM Models integration and more...☆22Dec 24, 2025Updated 2 months ago
- ☆12Nov 5, 2024Updated last year
- 🧩Using backtracking algorithm to solve binary puzzles☆11Jul 17, 2021Updated 4 years ago
- MCP server that enables AI assistants to interact with Qwen code☆23Aug 22, 2025Updated 6 months ago
- GUI for WireSock VPN client on Windows☆14Jul 8, 2024Updated last year
- SPAM filter rules for Stalwart Mail Server☆14Dec 16, 2025Updated 2 months ago