(AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to operating system kernel verification tasks.
☆13May 13, 2025Updated last year
Alternatives and similar repositories for OSVBench
Users that are interested in OSVBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 3 years ago
- ☆37Nov 13, 2025Updated 6 months ago
- Honeypot.☆11Apr 8, 2024Updated 2 years ago
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆16Aug 12, 2025Updated 9 months ago
- Ai Bartender☆26Aug 7, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Vim plugin for ATS☆16Jul 7, 2021Updated 4 years ago
- ☆26Jan 21, 2026Updated 3 months ago
- Language models for Coq based on data collected from the coq lsp.☆31Feb 23, 2026Updated 2 months ago
- ☆14Aug 18, 2025Updated 9 months ago
- ☆10Apr 15, 2023Updated 3 years ago
- [ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing☆13Feb 9, 2025Updated last year
- ☆10May 14, 2024Updated 2 years ago
- CSS-in-JS performance tests☆10Jan 4, 2017Updated 9 years ago
- ☆12Aug 9, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆13Nov 8, 2022Updated 3 years ago
- This is the tool released in ICSE 2024 paper "Domain Knowledge Matters: Improving Prompts with Fix Templates for Repairing Python Type Er…☆17Jun 5, 2023Updated 2 years ago
- ☆16Nov 24, 2023Updated 2 years ago
- ☆13Feb 29, 2024Updated 2 years ago
- ☆16Aug 26, 2023Updated 2 years ago
- ☆10Jul 19, 2023Updated 2 years ago
- ☆16Jan 17, 2024Updated 2 years ago
- ☆16Aug 16, 2023Updated 2 years ago
- Code for ICSE'24 Paper☆14Apr 21, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Mar 18, 2024Updated 2 years ago
- Neural Networks with Tensorflow☆12Jun 9, 2018Updated 7 years ago
- A curated list of awesome multi-modal recommendation.☆10Mar 16, 2022Updated 4 years ago
- Mutation-based Fault Localization of Deep Neural Networks☆10Jan 25, 2024Updated 2 years ago
- ☆18Jun 30, 2022Updated 3 years ago
- Learning Program Semantics for Vulnerability Detection via Vulnerability-specific Inter-procedural Slicing☆14Aug 21, 2023Updated 2 years ago
- ☆14Mar 1, 2023Updated 3 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆10Jun 10, 2022Updated 3 years ago
- Concurrent-C to Rust Automatic Translator☆15Jan 26, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13May 28, 2023Updated 2 years ago
- Vision Transformer-Inspired Automated Vulnerability Repair☆19May 13, 2025Updated last year
- [ICRA 2025] Official implementation for "TrackOcc: Camera-based 4D Panoptic Occupancy Tracking"☆57Apr 6, 2026Updated last month
- 实现 mini vite ,学习 vite 原理☆13Sep 19, 2021Updated 4 years ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- ☆33Nov 25, 2025Updated 5 months ago
- Synthesis API Refactor☆12May 17, 2022Updated 4 years ago