bltlab/mot

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bltlab/mot)

bltlab / mot

Multilingual Open Text

☆26

Alternatives and similar repositories for mot

Users that are interested in mot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bltlab / seqscore
View on GitHub
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
☆23Jul 16, 2026Updated last week
KurdishBLARK / KTC
View on GitHub
Kurdish Textbooks Corpus
☆10Feb 9, 2024Updated 2 years ago
uds-lsv / TOKEN-is-a-MASK
View on GitHub
Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"
☆14Aug 19, 2022Updated 3 years ago
quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 2 weeks ago
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JunjieHu / amber
View on GitHub
Explicit Alignment Objectives for Multilingual Bidirectional Encoders
☆14Apr 14, 2021Updated 5 years ago
UKPLab / acl2024-triple-encoders
View on GitHub
triple-encoders is a library for contextualizing distributed Sentence Transformers representations.
☆15Sep 3, 2024Updated last year
gentaiscool / miners
View on GitHub
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)
☆14Oct 3, 2024Updated last year
wietsedv / xpos
View on GitHub
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
☆19May 17, 2022Updated 4 years ago
ahmetustun / hyperx
View on GitHub
☆21Dec 5, 2022Updated 3 years ago
murali1996 / CodemixedNLP
View on GitHub
CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching
☆18Mar 29, 2021Updated 5 years ago
taesiri / PersianWordVectors
View on GitHub
A set of pre-trained word vectors for Persian language
☆15Jul 19, 2023Updated 3 years ago
bozheng-hit / VoCapXLM
View on GitHub
Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"
☆20Nov 12, 2021Updated 4 years ago
AIPHES / Language-Agnostic-Contextualized-Encoders
View on GitHub
☆14Feb 3, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
rtmdrr / DeepComparison
View on GitHub
☆21Oct 19, 2020Updated 5 years ago
jxjessieli / contextual-distortion-parser
View on GitHub
[ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.
☆14Jun 3, 2023Updated 3 years ago
KBNLresearch / KB-python-API
View on GitHub
Python API for KB data-services
☆20Jan 30, 2020Updated 6 years ago
malteos / clp-transfer
View on GitHub
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Jan 25, 2023Updated 3 years ago
RUCAIBox / VDA
View on GitHub
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models
☆16Sep 13, 2021Updated 4 years ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
jzbjyb / X-FACTR
View on GitHub
☆24Jun 12, 2023Updated 3 years ago
norakassner / mlama
View on GitHub
☆25Jan 22, 2024Updated 2 years ago
SAP-archive / acl2022-self-contrastive-decorrelation
View on GitHub
Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".
☆26Mar 10, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChristophAlt / fewrel
View on GitHub
Few-Shot Relation Extraction with AllenNLP
☆12Jan 27, 2019Updated 7 years ago
MANGA-UOFA / fdistill
View on GitHub
☆22Feb 4, 2026Updated 5 months ago
sustcsonglin / TN-PCFG
View on GitHub
source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conferenc…
☆52Mar 28, 2025Updated last year
IndoNLP / nusa-catalogue
View on GitHub
Dataset Catalogue Homepage for Indonesian Languages
☆12Feb 19, 2024Updated 2 years ago
ja-mcm / OCRfixr
View on GitHub
A context-based spellchecker for correcting OCR output.
☆21Feb 3, 2023Updated 3 years ago
cookielee77 / RankGan-NIPS2017
View on GitHub
Tensorflow implementation of RankGan (Adversarial Ranking for Language Generation)
☆22Jun 15, 2018Updated 8 years ago
jeffkinnison / shadho
View on GitHub
Scalable, structured, dynamically-scheduled hyperparameter optimization.
☆19Oct 13, 2022Updated 3 years ago
global-asp / asp-source
View on GitHub
Source stories from the African Storybook Project in Markdown format
☆22Jan 25, 2026Updated 5 months ago
ivanmontero / autobot
View on GitHub
Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'
☆17Mar 14, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hltcoe / concrete
View on GitHub
Thrift definitions, making HLT data specifications concrete
☆17Jul 10, 2023Updated 3 years ago
fbkarsdorp / twitter-workshop
View on GitHub
Workshop materials for scraping Twitter with Python
☆13May 25, 2016Updated 10 years ago
davidbp / learn_julia
View on GitHub
Tutorials for the julia language
☆12Feb 4, 2023Updated 3 years ago
LouChao98 / neural_based_dmv
View on GitHub
☆22Apr 14, 2020Updated 6 years ago
Pleias / OCRoscope
View on GitHub
Small python package to measure OCR quality and other related metrics.
☆26Feb 19, 2024Updated 2 years ago
ieg-dhr / NLP-Course4Humanities_2024
View on GitHub
This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…
☆20Jun 5, 2025Updated last year
aparnadutta / code-mixed-lid
View on GitHub
Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
☆10Aug 13, 2023Updated 2 years ago