open-voice-interoperability / openfloor-docsLinks
specifications and documentation for the Open Voice Interoperability Initiative Project
☆21Updated this week
Alternatives and similar repositories for openfloor-docs
Users that are interested in openfloor-docs are comparing it to the libraries listed below
Sorting:
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 7 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last week
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Whisper finetuning☆15Updated 8 months ago
- Indic-Conformer models for ASR☆20Updated last year
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆16Updated 7 months ago
- chatterbox TTS + Voice Clone using onnx☆26Updated this week
- Repo & Project for the Imminent Research Grant code & tasks☆12Updated last year
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆21Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Anaouder mouezh e Brezhoneg gant Vosk☆15Updated last month
- Transfer learning approach to pronunciation scoring☆11Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆17Updated 7 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆21Updated last year
- A simple, accessible and offline real-time transcription app for Android.☆13Updated last year
- Vaksanca introduces free Sanskrit speech corpus with vowel segmentation.☆16Updated 4 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 5 months ago
- ☆17Updated 9 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Updated 3 years ago
- ☆21Updated last week
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Simple audio AE☆13Updated last year
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆13Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Updated last year
- Implementation of MathReader, Text-to-Speech for Mathematical Documents☆24Updated 3 months ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Updated 4 years ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago