midas-research / speechmixView external linksLinks
☆12Oct 2, 2020Updated 5 years ago
Alternatives and similar repositories for speechmix
Users that are interested in speechmix are comparing it to the libraries listed below
Sorting:
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.☆21Oct 27, 2025Updated 3 months ago
- ☆42Jun 2, 2020Updated 5 years ago
- Chatbot using reinforcement learning☆19May 2, 2019Updated 6 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- ☆49Dec 8, 2022Updated 3 years ago
- Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식☆22Jul 21, 2021Updated 4 years ago
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆23Aug 12, 2019Updated 6 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆25Jun 6, 2022Updated 3 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- This is now the official location of the Kaldi project.☆24Nov 13, 2019Updated 6 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)☆36Jul 22, 2021Updated 4 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- FVN is now obsolete. Please use CAPRICEP instead. I will stop updating this tool. Frequency domain variants of Velvet Noise, a flexible b…☆38Aug 12, 2020Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- 🐝 Distributed real-time YouTube live chat collector☆42Jul 15, 2022Updated 3 years ago
- Semantic Search using FAISS & ElasticSearch☆31Jun 4, 2020Updated 5 years ago
- The stm32mp-sign-tool is an utility for signing and verifying firmware images compatible with STM32MP MPUs☆30Dec 9, 2025Updated 2 months ago
- ☆12Feb 5, 2026Updated last week
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Feb 6, 2026Updated last week
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)☆40Dec 3, 2021Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 4 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year
- Code for learning SLAMBOOK☆10Aug 4, 2018Updated 7 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Hashcrypt es una herramienta criptográfica que permite cifrar un texto en 5 tipos de hash.☆10Feb 8, 2021Updated 5 years ago
- Asteroid's filterbanks☆88Jan 12, 2025Updated last year
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆13Oct 8, 2016Updated 9 years ago
- Package containing the tools necessary for decomposing a speech signal into its modulated components (also known as AM-FM decomposition).…☆92May 23, 2025Updated 8 months ago