Tools for parsing the audio track in television news programs
☆19Apr 24, 2021Updated 5 years ago
Alternatives and similar repositories for Audio
Users that are interested in Audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google Summer of Code 2018 Project: Multilingual Neural Machine Translation System for TV News☆26Jan 21, 2024Updated 2 years ago
- ☆18Aug 29, 2020Updated 5 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Dialect identification using Siamese network☆15Dec 12, 2017Updated 8 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆35Jun 10, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- brainless concatenative text to speech☆16May 11, 2021Updated 5 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆53Apr 11, 2018Updated 8 years ago
- Source code for paper "Breaking Security-Critical Voice Authentication".☆13Jul 10, 2023Updated 2 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- This repository allows to use kaldi to train an i-vector extractor and extract i-vectors through a python interface.☆11Nov 27, 2017Updated 8 years ago
- An integration of Qdrant ANN vector database backend with txtai☆25Jun 5, 2026Updated 3 weeks ago
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Nov 15, 2017Updated 8 years ago
- PolyglotDB is a package for phonetic corpus storage and analysis☆51Jun 2, 2026Updated 3 weeks ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Codes for ACL2018 Multimodal Language Workshop paper☆10May 24, 2018Updated 8 years ago
- Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab☆44Aug 29, 2017Updated 8 years ago
- Respiratory Disorder Classification Based on Lung Auscultation sounds☆13Oct 22, 2024Updated last year
- Example on how to use pre trained networks on new classification problems.☆15Oct 3, 2018Updated 7 years ago
- A versatile, easily configurable vocoder software in MATLAB, for research purposes☆14Apr 9, 2021Updated 5 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- Python interface to Optotune focus-tunable lenses☆15Feb 4, 2020Updated 6 years ago
- Vocal Tract Modelling by Murphy, Shelley and Ternström☆17Nov 13, 2022Updated 3 years ago
- A fully and partially fake speech dataset for evaluation☆15Nov 11, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python classes for the Buckeye Corpus☆26Mar 30, 2018Updated 8 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Feb 25, 2017Updated 9 years ago
- Facebook Post Reactions dataset☆12Dec 7, 2017Updated 8 years ago
- Winners solutions for [WNS Analytics Wizard 2018](https://datahack.analyticsvidhya.com/contest/wns-analytics-hackathon-2018/)☆25Dec 13, 2018Updated 7 years ago
- ☆30Nov 9, 2018Updated 7 years ago
- Interoperability for Grasshopper and Revit☆22Aug 18, 2017Updated 8 years ago
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- A url shorten web site in node.js☆27Nov 10, 2017Updated 8 years ago
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆18Sep 25, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 9 years ago
- Toolkit for developing OData web services. Can be used from Web API, Nancy, or the platform of your choice.☆14Jun 30, 2022Updated 3 years ago
- A rough and ready Python utility which splits audio files based on silence and desired min/max chunk duration.☆16Jun 22, 2022Updated 4 years ago
- Automatic Dialect Detection Repository☆39Nov 13, 2022Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 5 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 13 years ago