A tiny BERT for low-resource monolingual models
☆31Dec 24, 2025Updated 2 months ago
Alternatives and similar repositories for microbert
Users that are interested in microbert are comparing it to the libraries listed below
Sorting:
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Jun 12, 2023Updated 2 years ago
- ☆11Nov 27, 2022Updated 3 years ago
- decontamination☆25Dec 3, 2025Updated 2 months ago
- “Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition” (EMNLP 2022)☆16Feb 2, 2023Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- The code and data for our paper (EMNLP 2023 findings) "Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition".☆35Jul 17, 2025Updated 7 months ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- Code for Navigating Connected Memories with a Task-oriented Dialog System☆17Dec 12, 2022Updated 3 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18May 10, 2023Updated 2 years ago
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆21Jul 19, 2023Updated 2 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- [EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge☆21Feb 12, 2023Updated 3 years ago
- Datamodels for hugging face tokenizers☆99Feb 20, 2026Updated last week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆55Apr 2, 2023Updated 2 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Apr 1, 2024Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆25May 12, 2024Updated last year
- Code for the Ask4Help project☆22Nov 24, 2022Updated 3 years ago
- Codebase for probing and visualizing multilingual models.☆49May 13, 2020Updated 5 years ago
- Multilingual Open Text☆25May 8, 2025Updated 9 months ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 4 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆33Feb 7, 2025Updated last year
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆27Feb 14, 2023Updated 3 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Jan 24, 2024Updated 2 years ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Feb 15, 2023Updated 3 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆78Feb 10, 2026Updated 2 weeks ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- ☆27Jul 25, 2023Updated 2 years ago