ltgoslo/ltg-bert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ltgoslo/ltg-bert)

ltgoslo / ltg-bert

LTG-Bert

☆34

Alternatives and similar repositories for ltg-bert

Users that are interested in ltg-bert are comparing it to the libraries listed below

Sorting:

ikergarcia1996 / T-Projection
View on GitHub
T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.
☆13Nov 21, 2023Updated 2 years ago
ltgoslo / gpt-bert
View on GitHub
Official implementation of "GPT or BERT: why not both?"
☆62Jul 28, 2025Updated 7 months ago
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Feb 9, 2023Updated 3 years ago
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
chandar-lab / NeoBERT
View on GitHub
☆107Jun 2, 2025Updated 9 months ago
TurkuNLP / bert-eval
View on GitHub
☆10Oct 15, 2019Updated 6 years ago
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
allenai / staged-training
View on GitHub
Staged Training for Transformer Language Models
☆33Mar 31, 2022Updated 3 years ago
ufal / multilexnorm2021
View on GitHub
MultiLexNorm 2021 competition system from ÚFAL
☆15Dec 30, 2021Updated 4 years ago
ejmichaud / precision-ml
View on GitHub
☆13Feb 12, 2023Updated 3 years ago
boschresearch / adversarial_meta_embeddings
View on GitHub
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"
☆13Dec 14, 2021Updated 4 years ago
UKPLab / acl2024-triple-encoders
View on GitHub
triple-encoders is a library for contextualizing distributed Sentence Transformers representations.
☆15Sep 3, 2024Updated last year
babylm / evaluation-pipeline-2025
View on GitHub
☆23Aug 19, 2025Updated 6 months ago
stefan-it / gc4lm
View on GitHub
GC4LM: A Colossal (Biased) language model for German
☆13May 2, 2021Updated 4 years ago
Ankush7890 / ssfinetuning
View on GitHub
A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning
☆14Oct 27, 2021Updated 4 years ago
mittagessen / curt
View on GitHub
☆14Jul 11, 2022Updated 3 years ago
tatianapassali / artificial-disfluency-generation
View on GitHub
Generating artificial disfluencies from fluent text easily and promptly
☆15Sep 28, 2022Updated 3 years ago
aiintelligentsystems / next-level-bert
View on GitHub
☆15Jun 14, 2024Updated last year
ielab / CharacterBERT-DR
View on GitHub
The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…
☆16May 4, 2022Updated 3 years ago
ltgoslo / norbench
View on GitHub
Natural language understanding benchmarks for Norwegian
☆14Aug 29, 2025Updated 6 months ago
mrpeerat / SCT
View on GitHub
SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)
☆16Jul 27, 2024Updated last year
Knowledgator / FlashDeBERTa
View on GitHub
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆78Feb 10, 2026Updated 3 weeks ago
tigerchen52 / LOVE
View on GitHub
ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
☆42Nov 15, 2023Updated 2 years ago
IDSIA / lmtool-fwp
View on GitHub
PyTorch Language Modeling Toolkit for Fast Weight Programmers
☆19Jun 11, 2025Updated 8 months ago
sophiaalthammer / parm
View on GitHub
This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…
☆41Jan 5, 2022Updated 4 years ago
apehex / tokun
View on GitHub
Tokun to can tokens
☆18Jun 19, 2025Updated 8 months ago
osainz59 / t5-encoder
View on GitHub
A extension of Transformers library to include T5ForSequenceClassification class.
☆40Apr 17, 2023Updated 2 years ago
proger / uk4b
View on GitHub
GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian
☆20Aug 6, 2023Updated 2 years ago
marionbartl / gender-bias-BERT
View on GitHub
This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…
☆18Sep 19, 2022Updated 3 years ago
mainlp / germanic-lrl-corpora
View on GitHub
Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…
☆26Feb 16, 2026Updated 2 weeks ago
masakhane-io / masakhane-news
View on GitHub
MasakhaNEWS: News Topic Classification for African Languages
☆25May 12, 2024Updated last year
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆28Apr 17, 2024Updated last year
stefan-it / italian-bertelectra
View on GitHub
🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)
☆18Oct 20, 2022Updated 3 years ago
vanzytay / NIPS2018_RCRN
View on GitHub
Tensorflow Source code for "Recurrently Controlled Recurrent Networks" (NIPS 2018)
☆23Oct 25, 2018Updated 7 years ago
unlp-workshop / unlp-2025-shared-task
View on GitHub
UNLP 2025 Shared Task on Detecting Social Media Manipulation
☆23Aug 4, 2025Updated 7 months ago
bminixhofer / tokenkit
View on GitHub
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.
☆64Jul 6, 2025Updated 8 months ago
ChrisHayduk / QLoRA-for-MLM
View on GitHub
QLoRA for Masked Language Modeling
☆23Sep 11, 2023Updated 2 years ago
cimeister / tokenizer-analysis-suite
View on GitHub
☆44Feb 11, 2026Updated 3 weeks ago
google / meta_tagger
View on GitHub
☆48Dec 23, 2018Updated 7 years ago