HammingHQ/bug-in-the-code-stack

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HammingHQ/bug-in-the-code-stack)

HammingHQ / bug-in-the-code-stack

A new benchmark for measuring LLM's capability to detect bugs in large codebase.

☆34

Alternatives and similar repositories for bug-in-the-code-stack

Users that are interested in bug-in-the-code-stack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lechmazur / deception
View on GitHub
Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…
☆33Mar 20, 2025Updated last year
anzax / dockashell
View on GitHub
DockaShell is an MCP server that gives AI agents isolated Docker containers to work in. MCP tools for shell access, file operations, and …
☆29Jun 6, 2025Updated last year
siddhantdubey / crapvectordb
View on GitHub
Very small demo for a TikTok tutorial
☆11Jun 17, 2024Updated 2 years ago
ziegler-ingo / CRAFT
View on GitHub
[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…
☆35Dec 5, 2025Updated 7 months ago
psunlpgroup / VisOnlyQA
View on GitHub
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…
☆29Jul 9, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gitwitorg / gitwit-server
View on GitHub
ExpressJS server for the GitWit React IDE.
☆16May 28, 2024Updated 2 years ago
ayyucekizrak / EfficientNet-Transfer-Learning-Implementation
View on GitHub
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
☆18Jan 10, 2020Updated 6 years ago
mozilla-ai / speech-to-text
View on GitHub
Blueprint by Mozilla.ai on how to transcribe audio files
☆23Jun 13, 2025Updated last year
StillAbeginnerr / gmail-to-whatsapp-notifier
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
nyunAI / Faster-LLM-Survey
View on GitHub
☆42Apr 23, 2024Updated 2 years ago
kyryl-opens-ml / fine-tune-llms-in-2024-with-trl
View on GitHub
☆12Apr 22, 2024Updated 2 years ago
stepansnigirev / chosen_nonce_demo
View on GitHub
two jupyter notebooks showing what could go wrong with nonces
☆13Jan 4, 2019Updated 7 years ago
0xcadams / hopfield
View on GitHub
🐇 Typescript-first LLM framework with static type inference, testability, and composability.
☆18Dec 1, 2024Updated last year
davanstrien / data-for-fine-tuning-llms
View on GitHub
☆80Jun 5, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JayAhn0104 / Recommender-System-PyTorch
View on GitHub
Recommendation Model Implementation by using PyTorch
☆10Nov 1, 2022Updated 3 years ago
saurabhaloneai / image-cap
View on GitHub
image captioninggg🐳
☆12Aug 30, 2024Updated last year
citicashio / citicash
View on GitHub
☆10Jan 25, 2019Updated 7 years ago
just-every / magi
View on GitHub
Mostly Autonomous Generative Intelligence
☆21Jun 20, 2026Updated last month
OpenNLPLab / LASP
View on GitHub
Linear Attention Sequence Parallelism (LASP)
☆87Jun 4, 2024Updated 2 years ago
michaelfeil / candle-flash-attn-v3
View on GitHub
☆15Dec 21, 2025Updated 7 months ago
opper-ai / delvin
View on GitHub
Agent fixing SWE bench issues
☆19May 21, 2024Updated 2 years ago
danielearwicker / knockout.clear
View on GitHub
Minimal utilities to make it easy to get KnockoutJS to clear up garbage automatically
☆16Oct 5, 2018Updated 7 years ago
bearstech / iptraf-ng
View on GitHub
This is a fork. The upstream is at https://fedorahosted.org/iptraf-ng/
☆12Sep 2, 2015Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
codeBelt / slush-project
View on GitHub
A slush generator to create different Client-side boilerplates while maintaining the same folder structure.
☆14Jul 15, 2016Updated 10 years ago
metalicjames / p2pool-Lyra2RE
View on GitHub
Modern P2Pool server for Lyra2RE coin, VTC.
☆11Dec 22, 2016Updated 9 years ago
som-shahlab / med-nota
View on GitHub
☆15Jun 11, 2025Updated last year
yashpokar / ManasAI
View on GitHub
Inspired by Cognition Labs' Devin AI, ManasAI is an open-source AI for software engineer. It aims to automate tasks, improve code quality…
☆17Sep 17, 2024Updated last year
jalbrethsen / double-agent
View on GitHub
☆12Aug 1, 2025Updated 11 months ago
Manish-GenAI / Deep-Learning-Based-Approach-to-Anomaly-Detection-Techniques-for-Large-Acoustic-Data-
View on GitHub
Deep-Learning-Based Approach to Anomaly Detection Techniques for Large Acoustic Data in Machine Operation.Developed a deep leaning algor…
☆21Jun 6, 2025Updated last year
zeno-ml / zeno-hub
View on GitHub
AI Evaluation Platform
☆49May 26, 2025Updated last year
center-for-humans-and-machines / transformer-heads
View on GitHub
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆299Feb 12, 2026Updated 5 months ago
yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lucasavila00 / LmScript
View on GitHub
Controllable Language Model Interactions in TypeScript
☆10May 17, 2024Updated 2 years ago
amitlevy / evolutionaryGPT
View on GitHub
Evolutionary Search for expert-level performance on any task with environmental feedback
☆14Oct 12, 2025Updated 9 months ago
labsroy007 / InVERGe
View on GitHub
☆11Mar 5, 2025Updated last year
Anindyadeep / ML_from_scratch
View on GitHub
Learning and rediscovering ML from total scratch
☆12Aug 30, 2021Updated 4 years ago
InfrHQ / Replay
View on GitHub
An Infr app that helps you replay & talk to everything you've ever seen.
☆15Sep 19, 2023Updated 2 years ago
renmengye / imageqa-qgen
View on GitHub
A question generator described in paper "Exploring Model and Data for Image Question Answering"
☆23Nov 21, 2015Updated 10 years ago
Cadenza-Labs / sleeper-agents
View on GitHub
☆15Jul 12, 2024Updated 2 years ago