philipperemy / name-datasetView external linksLinks
The Python library for names.
☆977Apr 9, 2025Updated 10 months ago
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below
Sorting:
- A database of number names for 186 languages, locales, and scripts☆67Mar 3, 2023Updated 2 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- How can we improve name matching in screening tools?☆15Aug 13, 2025Updated 6 months ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 5 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- The RadioTalk dataset of talk radio transcripts☆61Feb 11, 2021Updated 5 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆17Oct 30, 2024Updated last year
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆330Oct 31, 2025Updated 3 months ago
- A simple neural truecaser written in pytorch and allennlp.☆33Jun 17, 2024Updated last year
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Jun 29, 2021Updated 4 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆62May 13, 2020Updated 5 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- Pure C# port of the Pocketsphinx keyword spotter☆13Jan 19, 2020Updated 6 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆856Jan 23, 2026Updated 3 weeks ago
- Vespa application making an index of the CORD-19 dataset.☆40Jul 8, 2025Updated 7 months ago
- name2nat: a Python package for nationality prediction from a name☆115Oct 14, 2020Updated 5 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago