nuprl / MultiPL-TLinks
Knowledge transfer from high-resource to low-resource programming languages for Code LLMs
☆16Updated 3 months ago
Alternatives and similar repositories for MultiPL-T
Users that are interested in MultiPL-T are comparing it to the libraries listed below
Sorting:
- Training and Benchmarking LLMs for Code Preference.☆37Updated 11 months ago
- ☆28Updated this week
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆156Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆62Updated last year
- ☆55Updated last year
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆96Updated last month
- ☆40Updated 7 months ago
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆56Updated last year
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆64Updated last year
- ☆54Updated last year
- ☆33Updated last month
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆57Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Updated last year
- ☆15Updated 11 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆115Updated 6 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆122Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆80Updated last year
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆33Updated 3 months ago
- ☆41Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆60Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆82Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆72Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆48Updated this week
- ☆30Updated 10 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆78Updated 11 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 5 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆22Updated last year
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆36Updated 7 months ago
- ☆69Updated last year