Speech annotation web app for regular folk
☆23Aug 5, 2016Updated 9 years ago
Alternatives and similar repositories for aikuma-ng
Users that are interested in aikuma-ng are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Metadata Editor for Transparent Archiving of language document materials☆25Jun 1, 2026Updated 2 weeks ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 9 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Oct 16, 2018Updated 7 years ago
- Longyin Zhang, Fang Kong, and Guodong Zhou. Adversarial Learning for Discourse Rhetorical Structure Parsing. Accepted by ACL-IJCNLP2021.☆19Jan 12, 2023Updated 3 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- Audio Embeddings as Teachers for Music Classification☆13Sep 7, 2023Updated 2 years ago
- Android software for recording and translation☆30Feb 4, 2016Updated 10 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Sep 26, 2023Updated 2 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Nov 9, 2019Updated 6 years ago
- Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.☆14Feb 23, 2016Updated 10 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Oscillator-based speech syllabification algorithm☆11Sep 27, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Dynamic Spear Model☆12Jul 24, 2019Updated 6 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Example repository for a technical talk given at SenchaCon 2013☆42Dec 19, 2018Updated 7 years ago
- SEARCH by Sound Platform☆25Dec 12, 2015Updated 10 years ago
- This is now the official location of the Kaldi project.☆24Nov 13, 2019Updated 6 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆69Mar 15, 2018Updated 8 years ago
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆16Feb 1, 2026Updated 4 months ago
- Speech recognition in JavaScript☆18Oct 27, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 8 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- This is the repository for the Interspeech 2018 paper "Coherence models for dialogue".☆19Jan 9, 2020Updated 6 years ago
- An even smaller speech recognizer / force aligner☆36May 5, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆61Feb 2, 2023Updated 3 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆26Dec 3, 2018Updated 7 years ago
- EPUB Media Overlays javascript implementation☆14Aug 19, 2016Updated 9 years ago
- Per-collection OCR leaderboards using VLM-as-judge☆59Jun 2, 2026Updated 2 weeks ago
- Tools and scripts for working with ELAN☆10Aug 4, 2022Updated 3 years ago
- X-SAMPA to IPA converter☆29Nov 6, 2020Updated 5 years ago