Norod / hebrew-gpt_neoLinks
Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.
β22Updated 3 years ago
Alternatives and similar repositories for hebrew-gpt_neo
Users that are interested in hebrew-gpt_neo are comparing it to the libraries listed below
Sorting:
- β53Updated 3 years ago
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 3 years ago
- β18Updated last year
- β13Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β152Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β36Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)β48Updated 4 years ago
- German small and large versions of GPT2.β20Updated 3 years ago
- Using short models to classify long textsβ21Updated 2 years ago
- Automated paraphrases Generationβ36Updated 2 years ago
- HeBERT: Pre-training BERT for modern Hebrewβ79Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queriesβ19Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ31Updated 4 years ago
- Training a model without a dataset for natural language inference (NLI)β25Updated 5 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statementβ¦β16Updated 4 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β41Updated 3 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languagesβ11Updated last year
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3β18Updated 4 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.β83Updated 11 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".β99Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.β37Updated 3 years ago
- Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learningβ21Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitterβ110Updated last year
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentencesβ63Updated last year
- On Generating Extended Summaries of Long Documentsβ78Updated 4 years ago
- Ranking of fine-tuned HF models as base models.β36Updated 3 months ago
- β12Updated 4 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of β¦β61Updated 4 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β37Updated 4 years ago
- RaKUn 2.0 - A fast keyword detection algorithmβ68Updated 3 weeks ago