An open-access corpus of conversational bilingual speech in Cantonese and English
☆40Apr 28, 2022Updated 4 years ago
Alternatives and similar repositories for SpiCE-Corpus
Users that are interested in SpiCE-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A database of number names for 186 languages, locales, and scripts☆67Mar 3, 2023Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 5 years ago
- 문장단위로 분 절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Minimal Tensorflow Docker image with SyntaxNet/DRAGNN based on Alpine linux☆32Oct 7, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Shmoop Corpus☆17Oct 27, 2020Updated 5 years ago
- ☆88Mar 11, 2020Updated 6 years ago
- TEMP☆34Apr 2, 2020Updated 6 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- KoParadigm: Korean Inflectional Paradigm Generator☆58Nov 23, 2022Updated 3 years ago
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 4 years ago
- Basic python tornado app for handling websocket audio☆10Oct 5, 2023Updated 2 years ago
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)☆36Jul 22, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Reference implementation of the paper "Word Embeddings for Entity-annotated Texts"☆18Apr 12, 2019Updated 7 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 5 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Dec 6, 2024Updated last year
- some tutorials for blog: simonjisu.github.io☆23Mar 25, 2021Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- 세종 구문 분석 말뭉치의 의존 구문 구조로의 변환 도구☆10Sep 7, 2018Updated 7 years ago
- Prosody-semantics Interface in Seoul Korean☆12Oct 9, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"☆12May 15, 2023Updated 3 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆183May 17, 2019Updated 7 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137May 25, 2020Updated 6 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆190Mar 28, 2020Updated 6 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆245Jul 10, 2019Updated 6 years ago
- ☆13Nov 30, 2022Updated 3 years ago
- ☆13Jul 26, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Jun 30, 2020Updated 5 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆263Oct 11, 2019Updated 6 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html☆28Mar 17, 2026Updated 2 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- Build a dialog dataset from online books in many languages☆75Oct 25, 2022Updated 3 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆199Jul 17, 2021Updated 4 years ago