SIC98 / GPT2-python-code-generator
GPT2 finetuning with transformers π€
β28Updated 4 years ago
Alternatives and similar repositories for GPT2-python-code-generator:
Users that are interested in GPT2-python-code-generator are comparing it to the libraries listed below
- A basic and simple tool for code auto completionβ60Updated 9 months ago
- Code Generatorβ23Updated 2 years ago
- A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.β39Updated 3 years ago
- β69Updated 2 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AIβ57Updated last year
- Fine-tuning GPT-2 Small for Question Answeringβ130Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ33Updated 2 years ago
- β24Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorchβ37Updated 3 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loβ¦β39Updated last year
- Codebase for the Medium Article on Fine-tuning GPT2 for Text Generationβ70Updated 4 years ago
- Use-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).β20Updated 5 years ago
- Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.β32Updated 3 years ago
- Observe the slow deterioration of my mental sanity in the github commit historyβ12Updated last year
- BERT, RoBERTa fine-tuning over SQuAD Dataset using pytorch-lightningβ‘οΈ, π€-transformers & π€-nlp.β36Updated last year
- β28Updated 2 years ago
- A minimal TF2 re-implementation of the OpenAI GPT trainingβ57Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ49Updated 3 years ago
- Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.β75Updated 2 years ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"β70Updated 2 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.β20Updated 2 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)β52Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β37Updated 4 years ago
- β44Updated 3 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from π€ datasets with Apache Beam and Dataβ¦β26Updated 2 years ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answeringβ16Updated 2 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"β21Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborationsβ14Updated 2 years ago
- NLP Examples using the π€ librariesβ41Updated 4 years ago
- A repo for code based language modelsβ18Updated 4 years ago