A new benchmark for measuring LLM's capability to detect bugs in large codebase.
☆32Jun 5, 2024Updated last year
Alternatives and similar repositories for bug-in-the-code-stack
Users that are interested in bug-in-the-code-stack are comparing it to the libraries listed below
Sorting:
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Mar 20, 2025Updated last year
- LaunchPad is a light-weighted Slurm job launcher designed for hyper-parameter search.☆11Aug 2, 2024Updated last year
- browse wikipedia a la andy matuschak's evergreen notes☆28Aug 30, 2024Updated last year
- Deploy a llama.cpp server on fly.io☆14Jun 20, 2024Updated last year
- Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.☆29Jan 14, 2026Updated 2 months ago
- Facebook integration with iOS using swift language☆12Nov 26, 2014Updated 11 years ago
- Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. Reza Zhu's Solution: MBEG☆11May 17, 2024Updated last year
- Automatic support for (Claude) Skills for any coding agent that supports AGENTS.md☆28Feb 7, 2026Updated last month
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆34Dec 5, 2025Updated 3 months ago
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆36Feb 17, 2026Updated last month
- Code release for Adversarial Branch Architecture Search for Unsupervised Domain Adaptation☆13Mar 5, 2022Updated 4 years ago
- ExpressJS server for the GitWit React IDE.☆16May 28, 2024Updated last year
- Official repository for BMVC 2022 paper: Global Proxy-based Hard Mining for Visual Place Recognition☆18Mar 7, 2023Updated 3 years ago
- Sample repo for Shuttle Qdrant OpenAI☆15Nov 21, 2023Updated 2 years ago
- A holistic framework for advancing LLMs as data science agents☆39Feb 3, 2026Updated last month
- ☆42Apr 23, 2024Updated last year
- A package to read NumPy .npy files using Mathematica and the Wolfram Language☆13Sep 30, 2020Updated 5 years ago
- TextMate plugin (Cocoa) shell for running 'ack'☆25Jul 5, 2013Updated 12 years ago
- College Data Reconciliation Project☆16Jun 27, 2022Updated 3 years ago
- ☆12Apr 22, 2024Updated last year
- New Modeling The Background CodeBase☆15Jan 7, 2022Updated 4 years ago
- HTTP/2 Server Push & Service Workers example☆18Mar 1, 2017Updated 9 years ago
- Inspired by Cognition Labs' Devin AI, ManasAI is an open-source AI for software engineer. It aims to automate tasks, improve code quality…☆17Sep 17, 2024Updated last year
- Dubai Tour Guide App☆15Nov 8, 2024Updated last year
- Cryptography Final Project: Merkle Signature Scheme implementation☆15Dec 11, 2020Updated 5 years ago
- When real time Yoga Position classification meets GNN☆11Sep 17, 2023Updated 2 years ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆327Jan 1, 2025Updated last year
- ☆13Feb 4, 2025Updated last year
- Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025☆27Jul 14, 2025Updated 8 months ago
- two jupyter notebooks showing what could go wrong with nonces☆13Jan 4, 2019Updated 7 years ago
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- ☆14Sep 4, 2024Updated last year
- ☆14Dec 21, 2025Updated 3 months ago
- Agent fixing SWE bench issues☆19May 21, 2024Updated last year
- Minimal, Akka-styled actor system for TypeScript☆21Jan 22, 2026Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆89Oct 1, 2024Updated last year
- ☆126Aug 6, 2024Updated last year
- A robot that can interpret sheet music, convert it to a midi, then play it through on the piano☆13Sep 22, 2024Updated last year
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year