Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆105May 20, 2022Updated 3 years ago
Alternatives and similar repositories for smaller-transformers
Users that are interested in smaller-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆42May 5, 2021Updated 4 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- Personal information identification standard☆21Jan 24, 2024Updated 2 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Jun 12, 2023Updated 2 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 4 years ago
- Convert LaBSE model from TF Hub to PyTorch.☆16Jan 15, 2026Updated 2 months ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Oct 17, 2021Updated 4 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆35Apr 6, 2026Updated last week
- Query-focused summarization data☆44Feb 17, 2023Updated 3 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆90Sep 12, 2024Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- ☆15Apr 29, 2021Updated 4 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- Bayesian Deep Active Learning for Named entity recognition (NER)☆19Jan 17, 2020Updated 6 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Oct 17, 2023Updated 2 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆590Apr 24, 2023Updated 2 years ago
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- ☆118Oct 15, 2025Updated 5 months ago
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Dec 8, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Maximum entropy named-entity recognition (NER)☆13Dec 8, 2022Updated 3 years ago
- ☆31Apr 2, 2022Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Dec 2, 2022Updated 3 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 2 years ago
- Selection-based Question Answering☆14Feb 7, 2018Updated 8 years ago
- ☆13Oct 12, 2020Updated 5 years ago