HammingHQ / bug-in-the-code-stack

A new benchmark for measuring LLM's capability to detect bugs in large codebase.
27Updated 5 months ago

Related projects

Alternatives and complementary repositories for bug-in-the-code-stack