A new benchmark for measuring LLM's capability to detect bugs in large codebase.
☆32Jun 5, 2024Updated last year
Alternatives and similar repositories for bug-in-the-code-stack
Users that are interested in bug-in-the-code-stack are comparing it to the libraries listed below
Sorting:
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆26Aug 21, 2024Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Jul 9, 2025Updated 7 months ago
- benchmarks for LLM tokenizers☆17Jan 15, 2026Updated last month
- Cloud Benchmarker automates performance testing of cloud instances, offering insightful charts and tracking over time.☆37Updated this week
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆34Dec 5, 2025Updated 2 months ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- This repository contains the Parasol processor, which enables next-generation privacy preserving applications. Users can run arbitrary co…☆11Jan 5, 2026Updated last month
- ☆11Jan 3, 2024Updated 2 years ago
- Code action agent with local execution sandbox and first-class support for programmatic tool calling☆126Updated this week
- Brian uses GPT-4 to control your browser and perform repetitive actions on your behalf. Currently it allows you to define ad-hoc instruct…☆36May 5, 2023Updated 2 years ago
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- ☆15Sep 7, 2025Updated 5 months ago
- ☆22Jun 10, 2025Updated 8 months ago
- Machine Learning for Mathematical Formalization☆11Jul 20, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 5 years ago
- Direct transcription of an optimal control problem and resolution☆12Feb 17, 2026Updated last week
- Wait for async tasks☆13Dec 22, 2022Updated 3 years ago
- ☆37Updated this week
- Jupyter notebook templates for processing and analyzing neuroscience data.☆13Dec 28, 2025Updated last month
- ☆13Feb 4, 2025Updated last year
- A disjoint-sets/union-find implementation that allows for efficient iteration over the elements of a set.☆11Aug 8, 2023Updated 2 years ago
- A real-time collaborative code editor and previewer.☆10Mar 4, 2023Updated 2 years ago
- ☆12Aug 1, 2025Updated 6 months ago
- Transform messy HTML from Google Docs into well-structured HTML!☆13Jul 10, 2025Updated 7 months ago
- 📂 Como um grande fã da Marvel e um apaixonado por tecnologia e jogos, este projeto sem dúvidas é um dos meus favoritos até agora. O proj…☆10Aug 25, 2022Updated 3 years ago
- ☆12Oct 4, 2021Updated 4 years ago
- Fully automatic skin lesion segmentation using the Berkeley wavelet transform and UNet algorithm.☆12Jun 1, 2021Updated 4 years ago
- Python platform for parallel Surrogate-Based Optimization☆12Nov 27, 2024Updated last year
- Simple examples of using Java Libraries☆11Sep 1, 2022Updated 3 years ago
- ☆42Sep 19, 2024Updated last year
- ☆52Jul 18, 2024Updated last year
- A simple lexical scanner in Rust☆12Nov 8, 2020Updated 5 years ago
- A dApp, blockchain and crypto agnostic React UI toolkit☆11Feb 17, 2026Updated last week
- Soundboard with no limits☆12Nov 18, 2025Updated 3 months ago
- A 2D roguelike game☆12Sep 5, 2017Updated 8 years ago
- A command line interface for rxing☆10Mar 1, 2023Updated 2 years ago
- A procedural macro to combine multiple configuration methods at compile time☆12Mar 29, 2023Updated 2 years ago
- Estimate costs of complex LLM workflows in advance before spending money☆11Jan 10, 2026Updated last month