TalkBank / batchalign2Links
Tools for language sample analysis.
☆29Updated this week
Alternatives and similar repositories for batchalign2
Users that are interested in batchalign2 are comparing it to the libraries listed below
Sorting:
- Universal Romanizer that can convert any unicode script to roman (latin) script☆237Updated last year
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆36Updated 11 months ago
- ☆357Updated last year
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆292Updated 3 months ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆63Updated last year
- Massively multilingual pronunciation mining☆361Updated 3 weeks ago
- Audiobook alignment for Indigenous languages☆45Updated 3 weeks ago
- 🙊 software for creating speech recognition models.☆160Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆189Updated last week
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆20Updated last year
- ☆132Updated 2 weeks ago
- Utility for behavioral and representational analyses of Language Models☆173Updated this week
- Universal multilingual automatic speech transcription into IPA☆74Updated 11 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆186Updated 2 months ago
- Various speech datasets made available to the public☆130Updated last year
- Lightweight self-hosted span annotation tool☆39Updated 2 weeks ago
- SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/☆57Updated 5 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- Curriculum training☆22Updated 7 months ago
- Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…☆54Updated 2 months ago
- A guide to building language technology in new languages.☆59Updated 4 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆113Updated last year
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆90Updated last year
- Pipeline to generate the Standardized Project Gutenberg Corpus☆208Updated 2 years ago
- A massively multilingual modern encoder language model☆125Updated 2 weeks ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- PolyglotDB is a package for phonetic corpus storage and analysis☆50Updated last week
- Synthetic Dialog Generation and Analysis with LLMs☆124Updated this week
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 10 months ago