open-voice-interoperability / openfloor-docsLinks
specifications and documentation for the Open Voice Interoperability Initiative Project
☆21Updated 2 weeks ago
Alternatives and similar repositories for openfloor-docs
Users that are interested in openfloor-docs are comparing it to the libraries listed below
Sorting:
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 8 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last week
- Anaouder mouezh e Brezhoneg gant Vosk☆16Updated 2 months ago
- All-in-one Speech Transcription☆10Updated 2 weeks ago
- Repo & Project for the Imminent Research Grant code & tasks☆12Updated last year
- Indic-Conformer models for ASR☆20Updated last year
- Simple audio AE☆13Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- A simple, accessible and offline real-time transcription app for Android.☆14Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- ☆18Updated 10 months ago
- Whisper finetuning☆15Updated 10 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆14Updated 3 years ago
- Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.☆18Updated 6 months ago
- ☆10Updated last year
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Updated 4 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Updated 6 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- ☆23Updated 2 months ago
- Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, b…☆27Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- IPA Phonetic dataset lexicon☆18Updated 3 weeks ago
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆20Updated 8 months ago
- ☆13Updated last year