C++ Implementation of the Information Bottleneck System
☆22Jan 9, 2019Updated 7 years ago
Alternatives and similar repositories for IBDiarization
Users that are interested in IBDiarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Speech recognition using webrtc for FirefoxOS☆59Feb 10, 2014Updated 12 years ago
- Experimenting with musically motivated convolutional neural networks☆16Jun 8, 2016Updated 10 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A vector DB so easy, even your grandparents can build a RAG system 😁☆23Apr 1, 2026Updated 2 months ago
- Speech Signal Processing - a small collection of routines in Python to do signal processing☆46Aug 7, 2018Updated 7 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- 大众点评店铺信息爬虫程序,python、beautifulSoup,通过一个有规律的url,可以一页一页的获取到店铺的ID,从而完成所有的抓取工作。☆16Mar 6, 2016Updated 10 years ago
- Web server to connect Kaldi speech recognizers to real-time web clients☆17Jul 9, 2014Updated 11 years ago
- Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset☆12Aug 4, 2018Updated 7 years ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆18Jun 2, 2018Updated 8 years ago
- Java API for the online speech recognition services provided by phon.ioc.ee☆18Jun 4, 2021Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Nov 1, 2024Updated last year
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 7 years ago
- Repository to hold Music Information Retrieval related resources.☆14Dec 4, 2014Updated 11 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆21Jun 12, 2018Updated 7 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 5 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆88Feb 23, 2018Updated 8 years ago
- Software for Decoding of High Order Ambisonics to Irregular Layouts☆13Mar 20, 2014Updated 12 years ago
- FFT for PyCuda and PyOpenCL. The package is deprecated and its functionality is merged into Reikna.☆37Feb 17, 2014Updated 12 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- Audio captioning RNN model in Keras☆14Aug 27, 2016Updated 9 years ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Dec 2, 2019Updated 6 years ago
- The spider automatically crawls the original weibo and images of the specified user and categorizes the weibo.☆27Aug 10, 2017Updated 8 years ago
- A gym environment to train chatbots.☆20May 19, 2022Updated 4 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- Robust Principal Component Analysis☆10Apr 1, 2014Updated 12 years ago
- A GPU language model, based on btree backed tries.☆30Mar 6, 2018Updated 8 years ago
- ☆18May 15, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scripts for LIUM SpkDiarization tools☆31Aug 17, 2017Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Evaluation toolbox for Sound Event Detection☆161Jun 12, 2024Updated last year
- Перевод документации по dplyr☆11Aug 14, 2016Updated 9 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 11 months ago
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 9 years ago
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago