The Python library for names.
☆994Apr 9, 2025Updated last year
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A database of number names for 186 languages, locales, and scripts☆67Mar 3, 2023Updated 3 years ago
- How can we improve name matching in screening tools?☆16Aug 13, 2025Updated 9 months ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆19Oct 30, 2024Updated last year
- ☆21Sep 24, 2018Updated 7 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Grapheme to phoneme toolkit using joint-modelling + CRFs in java☆14Jul 14, 2018Updated 7 years ago
- A CSV file with US given names (first name) and their associated nicknames or diminutive names.☆316Apr 6, 2026Updated last month
- a python library for parsing unstructured western names into name components.☆618May 15, 2025Updated last year
- Convert words to numbers☆21Apr 13, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 19, 2019Updated 7 years ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆342May 1, 2026Updated 2 weeks ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 5 months ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆870Apr 20, 2026Updated last month
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Jun 29, 2021Updated 4 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆869Feb 19, 2023Updated 3 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 9 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,376Oct 27, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Aug 20, 2018Updated 7 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Fast, flexible name matching for large datasets☆71Aug 29, 2025Updated 8 months ago
- Code and data used in named entity transliteration experiments☆56Jun 4, 2018Updated 7 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- NeuSpell: A Neural Spelling Correction Toolkit☆711Jul 31, 2023Updated 2 years ago
- Language independent truecaser in Python.☆160Oct 17, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 4 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,463Jul 29, 2025Updated 9 months ago
- A simple neural truecaser written in pytorch and allennlp.☆33Jun 17, 2024Updated last year
- ☆21Jul 28, 2020Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago