AhmedSSoliman / MarianCG-NL-to-CodeLinks
This repository is the implementation of a Transformer model called MarianCG which is developed for the Code Generation problem.
☆21Updated 2 years ago
Alternatives and similar repositories for MarianCG-NL-to-Code
Users that are interested in MarianCG-NL-to-Code are comparing it to the libraries listed below
Sorting:
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆53Updated 3 years ago
- Script for downloading GitHub.☆95Updated last year
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆76Updated last year
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆96Updated 10 months ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆38Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- ☆151Updated 4 years ago
- Code for the paper "Efficient Training of Language Models to Fill in the Middle"☆183Updated 2 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆76Updated last year
- ☆165Updated 6 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆248Updated last year
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆195Updated last year
- ☆48Updated last year
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learning☆21Updated 2 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆156Updated 2 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- A unified benchmark for math reasoning☆88Updated 2 years ago
- ☆45Updated 3 weeks ago
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆35Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆83Updated last year
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- Adversarial Training on Transformer Networks to discover check-worthy factual claims☆78Updated last year
- Training language models to make programs faster☆91Updated last year
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Updated 4 years ago
- ☆124Updated last year
- LogiTorch is a PyTorch-based library for logical reasoning on natural language☆73Updated 10 months ago