lgessler/microbert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lgessler/microbert)

lgessler / microbert

A tiny BERT for low-resource monolingual models

☆32

Alternatives and similar repositories for microbert

Users that are interested in microbert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RiTUAL-MBZUAI / DA_NER
View on GitHub
“Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition” (EMNLP 2022)
☆16Feb 2, 2023Updated 3 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
stefan-it / xlm-v-experiments
View on GitHub
Experiments for XLM-V Transformers Integeration
☆13Feb 8, 2023Updated 3 years ago
allenai / decon
View on GitHub
decontamination
☆35Mar 4, 2026Updated 4 months ago
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
UniversalNER / UniversalNER
View on GitHub
☆28Apr 19, 2026Updated 3 months ago
allenai / better-promptability
View on GitHub
☆11Nov 27, 2022Updated 3 years ago
boschresearch / adversarial_meta_embeddings
View on GitHub
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"
☆13Dec 14, 2021Updated 4 years ago
timoschick / one-token-approximation
View on GitHub
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆12May 7, 2020Updated 6 years ago
sagorbrur / codeswitch
View on GitHub
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…
☆37Nov 2, 2020Updated 5 years ago
bigganbing / Fairseq_MorphTE
View on GitHub
[NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings
☆17Oct 29, 2022Updated 3 years ago
tylerachang / word-acquisition-language-models
View on GitHub
Word acquisition in neural language models (TACL 2022).
☆21Jan 30, 2025Updated last year
allenai / ask4help
View on GitHub
Code for the Ask4Help project
☆22Nov 24, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
m-clark / text-analysis-with-R
View on GitHub
Workshop that demonstrates using and analyzing text in R.
☆26Sep 9, 2018Updated 7 years ago
huggingface / datasets-tagging
View on GitHub
A Streamlit app to add structured tags to a dataset card
☆23Jun 30, 2022Updated 4 years ago
UIC-Liu-Lab / DGA
View on GitHub
[EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge
☆21Feb 12, 2023Updated 3 years ago
AI21Labs / pmi-masking
View on GitHub
This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper
☆14Aug 9, 2021Updated 4 years ago
facebookresearch / comet_memory_dialog
View on GitHub
Code for Navigating Connected Memories with a Task-oriented Dialog System
☆18Dec 12, 2022Updated 3 years ago
allenai / EmbeddingRecycling
View on GitHub
Embedding Recycling for Language models
☆38Jul 11, 2023Updated 3 years ago
echinaceous / multilingual-probing-visualization
View on GitHub
Codebase for probing and visualizing multilingual models.
☆48May 13, 2020Updated 6 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
gonglinyuan / metro_t0
View on GitHub
Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)
☆22Nov 1, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kayoyin / Prodigy
View on GitHub
CSE201 Objected-Oriented Programming in C++: Teach an AI to produce pieces of music
☆12Jan 23, 2019Updated 7 years ago
fritzwill / decision-tree
View on GitHub
Python 3 implementation of decision trees using the ID3 and C4.5 algorithms. ID3 uses Information Gain as the splitting criteria and C4.5…
☆10Feb 17, 2025Updated last year
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
microsoft / encoder-decoder-slm
View on GitHub
Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…
☆32Feb 7, 2025Updated last year
liyongqi2002 / TadNER
View on GitHub
The code and data for our paper (EMNLP 2023 findings) "Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition".
☆35Jul 17, 2025Updated last year
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
cisnlp / ofa
View on GitHub
[NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Nov 26, 2023Updated 2 years ago
cimeister / tokenizer-intrinsic-evals
View on GitHub
TokEval: intrinsic quality metrics for tokenizers across natural language, code, and math
☆46Jul 4, 2026Updated 2 weeks ago
rasbt / workflow-understanding-LLM-architectures
View on GitHub
Materials for the "My Workflow for Understanding LLM Architectures" tutorial
☆25Apr 10, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
masakhane-io / afriqa
View on GitHub
Crosslingual Question Answering for African Languages
☆31Sep 27, 2024Updated last year
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
ltgoslo / gpt-bert
View on GitHub
Official implementation of "GPT or BERT: why not both?"
☆64Jul 28, 2025Updated 11 months ago
pytorch-tpu / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆22Jan 25, 2023Updated 3 years ago
Sreyan88 / ACLM
View on GitHub
Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
☆22Jul 19, 2023Updated 3 years ago
Phylliida / MambaLens
View on GitHub
Mamba support for transformer lens
☆20Sep 17, 2024Updated last year
bltlab / mot
View on GitHub
Multilingual Open Text
☆26May 8, 2025Updated last year