A Chainer implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
☆28Jun 20, 2018Updated 7 years ago
Alternatives and similar repositories for chainer-openai-transformer-lm
Users that are interested in chainer-openai-transformer-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jul 8, 2018Updated 7 years ago
- Keras like network builder for Chainer☆11Oct 22, 2017Updated 8 years ago
- ☆16Jul 10, 2023Updated 2 years ago
- Implementation of "Effective Adversarial Regularization for Neural Machine Translation", ACL 2019☆21Jan 11, 2020Updated 6 years ago
- A novel baseline model for Story Cloze Test and ROCStories☆11May 14, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆124Nov 13, 2025Updated 4 months ago
- INSET: Sentence Infilling with Inter-sentential Transformer☆30Nov 21, 2020Updated 5 years ago
- Dynamic Entity Representation (Kobayashi et al., 2016)☆20Aug 11, 2016Updated 9 years ago
- Chainer implementation of InfoGAN☆19Jul 13, 2017Updated 8 years ago
- Chainer example codes list☆21Aug 19, 2016Updated 9 years ago
- Sample code for natural language processing using Wikipedia☆19Oct 23, 2018Updated 7 years ago
- Now it is exported as an official example☆13Jan 24, 2018Updated 8 years ago
- Reproduction work of "Neural Relational Inference for Interacting Systems" in Chainer☆34Feb 5, 2019Updated 7 years ago
- A tiny unfussy corpus-driven chatbot based on semantic similarity☆20Oct 27, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.☆324Oct 3, 2017Updated 8 years ago
- BlackOut and Adaptive Softmax for language models by Chainer☆11Oct 20, 2017Updated 8 years ago
- # ParlAI Agent examples with PyTorch, Chainer and TensorFlow☆46Jan 19, 2018Updated 8 years ago
- Domain Adaptation for anime face detection☆14Nov 25, 2019Updated 6 years ago
- Monitor parameter and gradient statistics during neural network training with Chainer☆13Jan 24, 2017Updated 9 years ago
- Python binding of primitiv.☆17Sep 12, 2022Updated 3 years ago
- A fast implementation of Neural Image Caption by Chainer☆16Aug 9, 2018Updated 7 years ago
- Auxiliary GAN for WE post-specialisation☆24Feb 22, 2019Updated 7 years ago
- This is a sample code of "LSTM encoder-decoder with attention mechanism" mainly for understanding a recently developed machine translatio…☆44Mar 14, 2019Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Chainer implementation of the paper Robust Conditional Generative Adversarial Networks☆15Jul 28, 2020Updated 5 years ago
- Data visualization app for H&M competition in kaggle☆12Apr 10, 2022Updated 3 years ago
- Generating Annotation Spreadsheet for QA-SRL Scheme☆12Feb 14, 2017Updated 9 years ago
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Jun 5, 2019Updated 6 years ago
- Deliver the ready-to-train data to your NLP model.☆122Jul 15, 2022Updated 3 years ago
- Interactive application to verify multiple LLMs☆14Feb 20, 2024Updated 2 years ago
- Deep Networks with Stochastic Depth implementation by Chainer☆40Apr 11, 2016Updated 9 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient☆33Feb 4, 2017Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Dec 2, 2018Updated 7 years ago
- Japanese data from the Google UDT 2.0.☆38Nov 12, 2025Updated 4 months ago
- Code for "Semantically Equivalent Adversarial Rules for Debugging NLP Models"☆87Oct 16, 2018Updated 7 years ago
- Python Binding to NVRTC☆79Oct 9, 2024Updated last year
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Draws an OSM map on a Matlab axes.☆14Apr 4, 2019Updated 6 years ago
- Rust implementation of SIF and uSIF: Simple and fast sentence embedding☆19Jan 22, 2025Updated last year