WeixiangYAN / CodeTransOcean
[EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
☆47Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CodeTransOcean
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆43Updated 3 weeks ago
- Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev…☆25Updated 2 weeks ago
- ☆12Updated 2 months ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆13Updated 2 months ago
- Training and Benchmarking LLMs for Code Preference.☆24Updated this week
- ☆25Updated last week
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated last month
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆39Updated 4 months ago
- ☆16Updated 4 months ago
- [ICML 2024] Self-Infilling Code Generation☆18Updated 6 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆28Updated 4 months ago
- [ACL 2024] The project of Symbol-LLM☆42Updated 4 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆57Updated 4 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆18Updated last month
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆59Updated 2 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 7 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆28Updated 3 weeks ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆33Updated 3 months ago
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆40Updated 5 months ago
- ☆85Updated 6 months ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆25Updated last year
- A Comprehensive Benchmark for Software Development.☆84Updated 5 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆51Updated 3 weeks ago
- ☆17Updated 4 months ago
- Reinforcement Learning for Repository-Level Code Completion☆15Updated 3 months ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆20Updated last year
- Large Language Models Meet NL2Code: A Survey☆34Updated this week
- Code for Findings of EMNLP2023 paper "Coarse-to-Fine Dual Encoders are Better Frame Identification Learners"☆12Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 7 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆100Updated 2 weeks ago
- The official repository of the Omni-MATH benchmark.☆49Updated 2 weeks ago