GammaTauAI / leetcode-hard-gymLinks
A hard gym for programming
☆162Updated last year
Alternatives and similar repositories for leetcode-hard-gym
Users that are interested in leetcode-hard-gym are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆232Updated last year
- Accepted by Transactions on Machine Learning Research (TMLR)☆136Updated last year
- Code for the paper "Efficient Training of Language Models to Fill in the Middle"☆194Updated 2 years ago
- ☆277Updated 2 years ago
- ☆173Updated 2 years ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆90Updated 2 years ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆301Updated 10 months ago
- Run evaluation on LLMs using human-eval benchmark☆426Updated 2 years ago
- A codebase for "Language Models can Solve Computer Tasks"☆238Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- ☆120Updated last year
- Simple next-token-prediction for RLHF☆227Updated 2 years ago
- ☆301Updated 2 years ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆167Updated last year
- Open Source WizardCoder Dataset☆162Updated 2 years ago
- ☆84Updated 2 years ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆163Updated last year
- Can Language Models Solve Olympiad Programming?☆123Updated 11 months ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208Updated 2 years ago
- ☆249Updated 3 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆220Updated 2 years ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆118Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆160Updated last year
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆479Updated 10 months ago
- ☆102Updated last year
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆61Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆64Updated last year
- ☆137Updated 2 years ago
- PaL: Program-Aided Language Models (ICML 2023)☆518Updated 2 years ago