A guide to building language technology in new languages.
☆60Feb 1, 2022Updated 4 years ago
Alternatives and similar repositories for newlang-tech
Users that are interested in newlang-tech are comparing it to the libraries listed below
Sorting:
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- Audiobook alignment for Indigenous languages☆45Feb 4, 2026Updated last month
- GUI applikation for the Klatt formant synthesizer package☆11Feb 16, 2026Updated 2 weeks ago
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- Toy example on how to build a unit selection TTS in Spanish☆11May 10, 2019Updated 6 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- ☆10Apr 3, 2024Updated last year
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆11Apr 9, 2025Updated 10 months ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- ☆22Apr 8, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- A massively multilingual corpus and pretrained model for IGT☆14Feb 21, 2026Updated last week
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 5 months ago
- ☆14Jan 17, 2023Updated 3 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Jun 14, 2024Updated last year
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Feb 4, 2026Updated last month
- ☆56Dec 19, 2022Updated 3 years ago
- ASR library☆14Dec 3, 2018Updated 7 years ago
- Split words with Unicode's default word boundary specification☆13Sep 12, 2024Updated last year
- ☆12Dec 9, 2015Updated 10 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- ☆23Oct 15, 2022Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Interlinear glossing with JS & CSS☆20Aug 23, 2015Updated 10 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- Convert Abstract Meaning Representation (AMR) into first-order logic☆16Aug 7, 2024Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- Code for the paper "UnNatural Language Inference" to appear at ACL 2021 (Long Paper)☆36Aug 31, 2021Updated 4 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆39Nov 6, 2025Updated 4 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago