Multilingual Open Text
☆25May 8, 2025Updated 11 months ago
Alternatives and similar repositories for mot
Users that are interested in mot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 23, 2022Updated 3 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Mar 30, 2026Updated last week
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- ☆17Jan 12, 2023Updated 3 years ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Normalized and modified version of Bijankhan corpus☆13Feb 21, 2023Updated 3 years ago
- MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)☆14Oct 3, 2024Updated last year
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 5 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- ☆14Feb 3, 2021Updated 5 years ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated this week
- ☆21Oct 19, 2020Updated 5 years ago
- Python API for KB data-services☆19Jan 30, 2020Updated 6 years ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A set of pre-trained word vectors for Persian language☆15Jul 19, 2023Updated 2 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models☆16Sep 13, 2021Updated 4 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated last month
- ☆25Jan 22, 2024Updated 2 years ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆18Dec 12, 2025Updated 4 months ago
- Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".☆26Mar 10, 2025Updated last year
- ☆19Sep 29, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An observatory of anglicism usage in the Spanish press☆11May 23, 2025Updated 10 months ago
- ☆22Feb 4, 2026Updated 2 months ago
- source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conferenc…☆52Mar 28, 2025Updated last year
- Tensorflow implementation of RankGan (Adversarial Ranking for Language Generation)☆22Jun 15, 2018Updated 7 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 4 years ago
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆19Jun 5, 2025Updated 10 months ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago
- Source stories from the African Storybook Project in Markdown format☆22Jan 25, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Thrift definitions, making HLT data specifications concrete☆16Jul 10, 2023Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- Tutorials for the julia language☆12Feb 4, 2023Updated 3 years ago
- Masakhane Web is a translation web application for solely African Languages.☆38Aug 11, 2023Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 9 months ago
- The implementation for "Open Relation Modeling: Learning to Define Relations between Entities" (Findings of ACL '22)☆12Feb 28, 2022Updated 4 years ago
- A context-based spellchecker for correcting OCR output.☆21Feb 3, 2023Updated 3 years ago