PROSE Public Benchmark Suite
☆33Sep 15, 2025Updated 8 months ago
Alternatives and similar repositories for prose-benchmarks
Users that are interested in prose-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 10, 2023Updated 2 years ago
- Vision Transformer-Inspired Automated Vulnerability Repair☆19May 13, 2025Updated last year
- program synthesis with neuro-symbolic differentiable interpreters☆17Sep 17, 2025Updated 8 months ago
- ☆21Jul 25, 2025Updated 9 months ago
- An implementation of Tare.☆12Feb 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- InferredBugs: a metadata-rich dataset of bugs and fixes in Java and C# programming languages extracted with the Infer static analyzer☆37Nov 30, 2023Updated 2 years ago
- Example Next.js application for App Runner with DynamoDB using Copilot CLI☆13Jan 29, 2026Updated 3 months ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- ☆22Jul 31, 2019Updated 6 years ago
- ☆41Jun 19, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- ☆12Feb 4, 2023Updated 3 years ago
- 实现一个自己的小语言模型☆11Jun 15, 2024Updated last year
- ☆83Nov 10, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 7 months ago
- ☆20May 30, 2024Updated last year
- ☆17Jan 7, 2025Updated last year
- ☆11Nov 30, 2022Updated 3 years ago
- ☆20Feb 22, 2024Updated 2 years ago
- Code using in Paper "Smart Contract Vulnerability Detection Based on Semantic Graph and Residual Graph Convolutional Networks with Edge A…☆16Apr 24, 2023Updated 3 years ago
- 刹那是永恒☆13Feb 26, 2020Updated 6 years ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆60Jul 24, 2025Updated 9 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code4Bench: A Mutildimensional Benchmark of Codeforces Data for Different Program Analysis Techniques☆17Apr 12, 2019Updated 7 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 3 years ago
- This repo is for anonymized review. We will keep updating and optimizing this program.☆16Oct 18, 2024Updated last year
- A public project which includes all test cases for fireline.☆16Jun 1, 2017Updated 8 years ago
- Swing image viewer component☆15Jul 30, 2012Updated 13 years ago
- Characterizing Transaction-Reverting Statements in Ethereum Smart Contracts.☆11Sep 1, 2021Updated 4 years ago
- ☆15Aug 27, 2022Updated 3 years ago
- [SANER 2023] MixCode: Enhancing Code Classification by Mixup-Based Data Augmentation☆15Jul 13, 2024Updated last year
- A project to automatically generate program repair recommendation in the field of smart contracts for given code snippets with their cont…☆16Aug 30, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation☆20Jun 28, 2023Updated 2 years ago
- Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.☆29Apr 6, 2025Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆15Nov 22, 2023Updated 2 years ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆12Apr 29, 2024Updated 2 years ago
- Code of Truman: Constructing Device Behavior Models from OS Drivers to Fuzz Virtual Devices (NDSS 2025)☆24Apr 11, 2025Updated last year
- ICCV 2021: Deep Co-Training with Task Decomposition for Semi-supervised Domain Adaptation☆17Dec 8, 2022Updated 3 years ago