GammaTauAI / leetcode-hard-gym
A hard gym for programming
☆153Updated 10 months ago
Alternatives and similar repositories for leetcode-hard-gym
Users that are interested in leetcode-hard-gym are comparing it to the libraries listed below
Sorting:
- Accepted by Transactions on Machine Learning Research (TMLR)☆127Updated 7 months ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆112Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆339Updated 8 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆217Updated last year
- Simple next-token-prediction for RLHF☆225Updated last year
- ☆84Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆120Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆234Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- ☆115Updated 10 months ago
- ☆270Updated 2 years ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- ☆172Updated last year
- ☆92Updated 10 months ago
- Open Source WizardCoder Dataset☆158Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆64Updated 8 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆308Updated 6 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆123Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆87Updated last year
- Fine-tune SantaCoder for Code/Text Generation.☆192Updated 2 years ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆140Updated 9 months ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆246Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆223Updated 2 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- r2e: turn any github repository into a programming agent environment☆119Updated 3 weeks ago
- ☆238Updated 2 years ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆285Updated 3 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆159Updated last year
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆163Updated 9 months ago