cisnlp/GlotScript

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cisnlp/GlotScript)

cisnlp / GlotScript

[LREC 2024] 🖋 Resource and Tool for Writing System Identification

☆22

Alternatives and similar repositories for GlotScript

Users that are interested in GlotScript are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cisnlp / GlotLID
View on GitHub
[EMNLP 2023] 💬 Language Identification with Support for More Than 2000 Labels
☆210Apr 15, 2026Updated 3 months ago
cisnlp / Glot500
View on GitHub
[ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
☆107Apr 14, 2026Updated 3 months ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
cisnlp / MEXA
View on GitHub
[ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
☆11Apr 6, 2025Updated last year
StonyBrookNLP / PerSenT
View on GitHub
[COLING2020] A challenge dataset for Person SenTiment analysis in news domain.
☆11May 2, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cisnlp / ofa
View on GitHub
[NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Nov 26, 2023Updated 2 years ago
de-mh / g2p_fa
View on GitHub
A Grapheme to Phoneme model using LSTM implemented in pytorch
☆14Jul 6, 2022Updated 4 years ago
mbanon / fastspell
View on GitHub
Targetted language identifier, based on FastText and Hunspell.
☆38Sep 4, 2025Updated 10 months ago
kargaranamir / parstdex
View on GitHub
A package that extracts Persian time and date markers by applying regexes -- AACL 2022
☆28Nov 29, 2022Updated 3 years ago
tylerachang / multilingual-geometry
View on GitHub
The geometry of multilingual language model representations (EMNLP 2022).
☆22Oct 21, 2022Updated 3 years ago
swiss-ai / parity-aware-bpe
View on GitHub
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [ACL 2026]
☆19Apr 18, 2026Updated 3 months ago
dmort27 / HsSPE
View on GitHub
Haskell phonology library.
☆10Jan 23, 2012Updated 14 years ago
langtech-bsc / mt-evaluation
View on GitHub
A framework for evaluating Machine Translation models.
☆13Apr 21, 2026Updated 3 months ago
LCR-ADS-Lab / TAALED
View on GitHub
Tool for the automatic assessment of lexical diversity
☆14Sep 6, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
afrisenti-semeval / afrisent-semeval-2023
View on GitHub
AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
☆53Jan 10, 2024Updated 2 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
dinesh-git17 / passfx
View on GitHub
A zero-knowledge, local-first TUI for managing secrets — built with standard cryptography and designed to never touch the network.
☆15Jun 22, 2026Updated 3 weeks ago
google-research / nisaba
View on GitHub
Finite-state script normalization and processing utilities
☆52Jun 24, 2026Updated 3 weeks ago
epfl-dlab / llm-latent-language
View on GitHub
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆87Mar 11, 2024Updated 2 years ago
nuhaalbadi / Arabic_hatespeech
View on GitHub
Religious Hate Speech Detection for Arabic Tweets
☆26Feb 1, 2019Updated 7 years ago
mannefedov / hse_ml_m1
View on GitHub
Курс по машинному обучению для магистров компьютерной лингвистики 1-го курса в Высшей Школе Экономики
☆16May 13, 2020Updated 6 years ago
asafamr / SymPatternWSI
View on GitHub
Word Sense Induction with neural Bi-language Models and symmetric patterns
☆12Aug 31, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gowitheflow-1998 / Pixel-Linguist
View on GitHub
☆15Mar 8, 2024Updated 2 years ago
babylm / evaluation-pipeline-2024
View on GitHub
The evaluation pipeline for the 2024 BabyLM Challenge.
☆34Nov 13, 2024Updated last year
laurieburchell / open-lid-dataset
View on GitHub
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
☆77Apr 1, 2025Updated last year
Linguistic-Data-Consortium / ldc-bpcsad
View on GitHub
A speech activity detector using HMMs
☆11Feb 11, 2026Updated 5 months ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
fonttools / unicodedata2
View on GitHub
unicodedata backport/updates
☆39Mar 5, 2026Updated 4 months ago
jmccrae / yuzu
View on GitHub
Micro-framework for publishing linked data
☆11Aug 1, 2017Updated 8 years ago
sai-prasanna / lmproof
View on GitHub
Language model powered proof reader for correcting contextual errors in natural language.
☆24Jul 6, 2023Updated 3 years ago
CPJKU / wechsel
View on GitHub
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆91Sep 12, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AndreasMadsen / course-02456-sparsemax
View on GitHub
TensorFlow and Numpy implementation of sparsemax
☆15Dec 22, 2019Updated 6 years ago
Yaoming95 / UniPunc
View on GitHub
The case study and multilingfual performance of ICASSP submission
☆24Sep 24, 2022Updated 3 years ago
kojima-takeshi188 / lang_neuron
View on GitHub
☆21Jun 24, 2024Updated 2 years ago
alexandra-chron / lexical_xlm_relm
View on GitHub
PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…
☆18Oct 18, 2022Updated 3 years ago
WebSpellChecker / wproofreader
View on GitHub
WProofreader software development kit (SDK) offers multilingual spelling & grammar check API and JavaScript libraries for rich text edito…
☆13Jun 25, 2026Updated 3 weeks ago
TalasZh / opencbs
View on GitHub
Open-source core banking system (forked off of Octopus Microfinance Suite v4.7)
☆14May 27, 2013Updated 13 years ago
produle / prevjs
View on GitHub
Static website generator that is simple to use
☆12Dec 17, 2022Updated 3 years ago