Minimal code to train ELMo models in recent versions of TensorFlow
☆14Apr 30, 2023Updated 2 years ago
Alternatives and similar repositories for simple_elmo_training
Users that are interested in simple_elmo_training are comparing it to the libraries listed below
Sorting:
- ☆11Nov 14, 2021Updated 4 years ago
- ☆99Jul 7, 2020Updated 5 years ago
- ☆13Mar 27, 2020Updated 5 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆61Dec 8, 2022Updated 3 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆20Jan 8, 2026Updated last month
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.☆16Oct 14, 2020Updated 5 years ago
- ☆32Apr 4, 2020Updated 5 years ago
- ☆16May 6, 2021Updated 4 years ago
- ☆30Sep 27, 2021Updated 4 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Jun 5, 2019Updated 6 years ago
- Temporary remove unused tokens during training to save ram and speed.☆23Jun 15, 2025Updated 8 months ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 2 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Mar 17, 2020Updated 5 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆63Apr 19, 2022Updated 3 years ago
- A collection of notebooks for Natural Language Processing☆25Jan 13, 2025Updated last year
- On disentangling the menagerie of disentanglement papers☆27Dec 25, 2019Updated 6 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- ☆104Jan 14, 2021Updated 5 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Sep 17, 2022Updated 3 years ago
- ☆27Updated this week
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 4 months ago
- ☆75Jul 2, 2021Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Apr 21, 2021Updated 4 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆47Feb 19, 2025Updated last year
- NLP Examples using the 🤗 libraries☆40Feb 21, 2021Updated 5 years ago
- Free programming language books☆10Jun 4, 2020Updated 5 years ago
- Collection of iPython notebooks with some quick demos☆11May 25, 2017Updated 8 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Sep 22, 2020Updated 5 years ago
- Viewer for the 🤗 datasets library.☆86Jul 30, 2021Updated 4 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 2 years ago