TalkBank / batchalign2Links
Tools for language sample analysis.
☆29Updated this week
Alternatives and similar repositories for batchalign2
Users that are interested in batchalign2 are comparing it to the libraries listed below
Sorting:
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆36Updated 11 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆237Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆189Updated last week
- A massively multilingual modern encoder language model☆125Updated 2 weeks ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- ☆388Updated last year
- Universal multilingual automatic speech transcription into IPA☆74Updated 11 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆186Updated 2 months ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆267Updated last year
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆292Updated 3 months ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆63Updated last year
- Batch Support for OpenAI Whisper☆97Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆189Updated last year
- ☆357Updated last year
- A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.☆16Updated 2 years ago
- Speaker Diarization with Transformers☆70Updated 7 months ago
- Various speech datasets made available to the public☆130Updated last year
- ☆323Updated last year
- Multilingual G2P in 100 languages☆374Updated 2 years ago
- Massively multilingual pronunciation mining☆361Updated 3 weeks ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆228Updated 8 months ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆51Updated 2 years ago
- ☆57Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Library for Textless Spoken Language Processing☆555Updated 2 years ago
- babyLM WhisBERT code☆19Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- 🙊 software for creating speech recognition models.☆160Updated last year
- Open-source reproducible benchmarks from Argmax☆77Updated 2 weeks ago
- Timething is a library for aligning text transcripts with their audio recordings.☆128Updated last year