A hard gym for programming
☆166Jul 7, 2024Updated last year
Alternatives and similar repositories for leetcode-hard-gym
Users that are interested in leetcode-hard-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning☆3,114Jan 14, 2025Updated last year
- Using Large Language Models for Repo-wide Type Prediction☆113Dec 10, 2023Updated 2 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 5 months ago
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆48Sep 13, 2025Updated 6 months ago
- [AAAI 2025] The official code of the paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct"(http…☆14Jul 10, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization☆49Dec 3, 2025Updated 4 months ago
- Parallel data preprocessing for NLP and ML.☆34Nov 1, 2024Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- [ICML'24] Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More☆14Jul 12, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆86Jul 13, 2024Updated last year
- [AAMAS 2025 Oral] CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems☆32Dec 3, 2025Updated 4 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Aug 31, 2024Updated last year
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- jQuery, React and Streamlit applications written by LLMs☆15Dec 24, 2023Updated 2 years ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆388Nov 26, 2023Updated 2 years ago
- The collection of my research papers' illustrations.☆20Oct 15, 2023Updated 2 years ago
- A multi-programming language benchmark for LLMs☆301Jan 28, 2026Updated 2 months ago
- ☆14May 9, 2024Updated last year
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- Source code for Grounded Adaptation for Zero-shot Executable Semantic Parsing☆21Feb 1, 2021Updated 5 years ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,709Oct 2, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆34Mar 21, 2026Updated 3 weeks ago
- Repository for the EMNLP 2023 Demo Paper "Reaction Miner: An Integrated System for Chemical Reaction Extraction from Textual Data"☆19Jan 27, 2025Updated last year
- ☆46Oct 11, 2023Updated 2 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- RepoLaunch is an agentic SWE tool aimed at automating the build, execution and test of GitHub repositories across programming languages a…☆70Updated this week
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆50Jan 1, 2024Updated 2 years ago
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790☆27Nov 21, 2022Updated 3 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Mar 30, 2026Updated last week
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆565Jan 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆169Oct 11, 2024Updated last year
- Evaluating Reward Models in Multilingual Settings (ACL Main '25)☆41May 16, 2025Updated 10 months ago
- Aider's refactoring benchmark exercises based on popular python repos☆82Oct 10, 2024Updated last year
- ☆11Aug 26, 2024Updated last year
- [NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution☆265Jan 24, 2026Updated 2 months ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago