open-voice-interoperability / openfloor-docsLinks
specifications and documentation for the Open Voice Interoperability Initiative Project
☆21Updated last week
Alternatives and similar repositories for openfloor-docs
Users that are interested in openfloor-docs are comparing it to the libraries listed below
Sorting:
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 8 months ago
- All-in-one Speech Transcription☆10Updated this week
- Indic-Conformer models for ASR☆20Updated last year
- Whisper finetuning☆15Updated 9 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- Anaouder mouezh e Brezhoneg gant Vosk☆16Updated 2 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- ☆18Updated 10 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last week
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- Пакет словарей русского языка с поддержкой букв Е и Ё☆13Updated 7 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 10 months ago
- Repo & Project for the Imminent Research Grant code & tasks☆12Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- An end-to-end library for training audio wake-word models and deploying them in the browser.☆38Updated 6 months ago
- English ASR Challenge organized by Speech Lab, IIT Madras☆11Updated 4 years ago
- chatterbox TTS + Voice Clone using onnx☆27Updated 3 weeks ago
- ☆11Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- ☆16Updated this week
- ☆13Updated last year
- Vaksanca introduces free Sanskrit speech corpus with vowel segmentation.☆16Updated 4 years ago
- A simple, accessible and offline real-time transcription app for Android.☆14Updated last year
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆21Updated 2 years ago
- Russian accentuator and IPA transcriber☆17Updated last year
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Updated 6 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Updated 3 years ago