Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆107May 20, 2022Updated 4 years ago
Alternatives and similar repositories for smaller-transformers
Users that are interested in smaller-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆20Sep 22, 2021Updated 4 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 3 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 4 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆42May 5, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Jun 12, 2023Updated 2 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 4 years ago
- Convert LaBSE model from TF Hub to PyTorch.☆15Jan 15, 2026Updated 4 months ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- ☆19Apr 26, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Oct 17, 2021Updated 4 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Mar 30, 2026Updated last month
- The collection of bulding blocks building fine-tunable metric learning models☆35Apr 6, 2026Updated last month
- Query-focused summarization data☆44Feb 17, 2023Updated 3 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆90Sep 12, 2024Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- ☆15Apr 29, 2021Updated 5 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- Bayesian Deep Active Learning for Named entity recognition (NER)☆19Jan 17, 2020Updated 6 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Oct 17, 2023Updated 2 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆588Apr 24, 2023Updated 3 years ago
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- ☆121Oct 15, 2025Updated 7 months ago
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Dec 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Maximum entropy named-entity recognition (NER)☆13Dec 8, 2022Updated 3 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Dec 2, 2022Updated 3 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 3 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Mar 8, 2022Updated 4 years ago
- Selection-based Question Answering☆14Feb 7, 2018Updated 8 years ago