Norod / hebrew-gpt_neoLinks
Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.
☆22Updated 3 years ago
Alternatives and similar repositories for hebrew-gpt_neo
Users that are interested in hebrew-gpt_neo are comparing it to the libraries listed below
Sorting:
- HeBERT: Pre-training BERT for modern Hebrew☆80Updated 2 years ago
- ☆18Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- A gym environment to train chatbots.☆21Updated 3 years ago
- Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.☆17Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- ☆54Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- ☆13Updated 2 years ago
- Automated paraphrases Generation☆36Updated 2 years ago
- A search engine for ParlAI's BlenderBot project (and probably other ones as well)☆130Updated 3 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆84Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Updated last week
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Updated 4 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆41Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆156Updated last year
- A python package to augment text data using NLP.☆39Updated 8 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated last year
- Shared code for training sentence embeddings with Flax / JAX☆28Updated 4 years ago
- Experiments for XLM-V Transformers Integeration☆13Updated 2 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆43Updated 2 years ago
- Comprehensive NLP Evaluation System☆188Updated last year
- Google's Meena transformer chatbot implementation☆105Updated 3 years ago
- German small and large versions of GPT2.☆20Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year