open-voice-interoperability / openfloor-docsLinks
specifications and documentation  for the Open Voice Interoperability Initiative Project
☆19Updated last week
Alternatives and similar repositories for openfloor-docs
Users that are interested in openfloor-docs are comparing it to the libraries listed below
Sorting:
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 5 months ago
 - A simple, accessible and offline real-time transcription app for Android.☆12Updated last year
 - A composition of offline tools to achieve high quality multilingual speech to text transcription☆22Updated 2 months ago
 - Anaouder mouezh e Brezhoneg gant Vosk☆14Updated 3 months ago
 - eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
 - llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
 - Indic-Conformer models for ASR☆18Updated last year
 - A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆20Updated 5 years ago
 - Launch your speech synthesis within one minute.☆12Updated last year
 - Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆13Updated 5 months ago
 - CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 4 months ago
 - Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
 - Whisper finetuning☆15Updated 6 months ago
 - WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
 - Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
 - 🎵 muse: Music Separation☆10Updated last year
 - A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
 - Neural model for prediction of stress position in Russian words☆11Updated 4 months ago
 - This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆18Updated last year
 - Simple audio AE☆12Updated 11 months ago
 - ☆13Updated 10 years ago
 - Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.☆14Updated 3 months ago
 - ☆23Updated last week
 - Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Updated 6 months ago
 - A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer library, written in Flutter.☆10Updated 2 years ago
 - Text-to-Speech Latency Benchmark☆18Updated 4 months ago
 - ☆11Updated 3 years ago
 - Getting confidences from any end-to-end systems☆11Updated 2 years ago
 - ☆13Updated last week
 - Using OpenVINO to speed up MeloTTS inference☆13Updated last year