nuprl / MultiPL-TLinks

Knowledge transfer from high-resource to low-resource programming languages for Code LLMs

☆14

Alternatives and similar repositories for MultiPL-T

Users that are interested in MultiPL-T are comparing it to the libraries listed below

Sorting:

amazon-science / llm-code-preference
Training and Benchmarking LLMs for Code Preference.
☆33Updated 8 months ago
evalplus / repoqa
RepoQA: Evaluating Long-Context Code Understanding
☆111Updated 8 months ago
Ablustrund / APPS_Plus
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆67Updated 10 months ago
crux-eval / eval-arena
☆28Updated this week
logic-star-ai / swt-bench
[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
☆51Updated last month
ntunlp / ExecEval
A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.
☆55Updated 8 months ago
ise-uiuc / xft
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
☆33Updated last year
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆149Updated 9 months ago
xyliu-cs / RISE
Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)
☆28Updated last week
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆58Updated last year
qishenghu / InstructCoder
InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw
☆61Updated 9 months ago
gangiswag / cornstack
☆35Updated 3 weeks ago
CodeEditorBench / CodeEditorBench
☆49Updated last year
nuprl / CanItEdit
Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions
☆44Updated 11 months ago
THUDM / SWE-Dev
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
☆47Updated last week
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆48Updated last year
SparksofAGI / MHPP
☆31Updated 3 weeks ago
gonglinyuan / safim
☆36Updated 2 months ago
huggingface / ioi
☆35Updated 3 months ago
SalesforceAIResearch / CodeChain
Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"
☆45Updated 6 months ago
mathllm / MathCoder2
☆63Updated 9 months ago
ntunlp / LLMSanitize
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
☆57Updated 11 months ago
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆81Updated 11 months ago
SalesforceAIResearch / swecomm
☆27Updated 6 months ago
EthanLeo-LYX / LLMQA
[WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering
☆15Updated 2 months ago
allenai / super-benchmark
☆45Updated 3 months ago
shunzh / mcts-for-llm
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆16Updated last year
gonglinyuan / ast_t5
☆65Updated last year
Zayne-sprague / MuSR
☆45Updated 11 months ago
NL2Code / NL2Code.github.io
Large Language Models Meet NL2Code: A Survey
☆35Updated 7 months ago