Basic python tornado app for handling websocket audio
☆10Oct 5, 2023Updated 2 years ago
Alternatives and similar repositories for audiosocket_framework
Users that are interested in audiosocket_framework are comparing it to the libraries listed below
Sorting:
- Capture microphone input from browser using webaudio and live stream to remote server using websocket & opus codec.☆21Apr 26, 2019Updated 6 years ago
- Demo application of Google Cloud Speech recognition☆21Nov 26, 2025Updated 3 months ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆63Dec 23, 2025Updated 2 months ago
- Demo App of Nexmo Voice API WebSockets sending call audio to a WebBrowser to playback using the Web Audio API☆31Dec 12, 2025Updated 2 months ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆36Mar 24, 2023Updated 2 years ago
- Text Normalization utilities for normalizing text for TTS☆21Updated this week
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 2 years ago
- ☆13Updated this week
- Instant.bot package manager and command line tools☆15Feb 7, 2026Updated 3 weeks ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Flask skeleton using Bootstrap, SCSS, Docker, console and rotating file logging, HTTP basic auth and web and api views with Blueprint☆10Nov 18, 2018Updated 7 years ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- vpype vector tracing plugin☆13Apr 14, 2022Updated 3 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆13Sep 8, 2019Updated 6 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- My first ever training of a piper tts voice☆16May 23, 2025Updated 9 months ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- Local audio recorder (no streaming server required). Currently requires Flash Player 10.1 or above.☆14May 26, 2014Updated 11 years ago
- ☆25Jun 7, 2013Updated 12 years ago
- Haskell phonology library.☆10Jan 23, 2012Updated 14 years ago
- Course materials for a 3-day seminar "Machine Learning and NLP: Advances and Applications" at New College of Florida☆12Feb 10, 2022Updated 4 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- A JUCE module wrapper for Apple's zero-configuration protocol Bonjour☆12May 14, 2021Updated 4 years ago
- Database of annotated field recording samples that can be used for training audio labelling algorithms☆10Feb 1, 2019Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago