open-voice-interoperability / openfloor-docsLinks
specifications and documentation for the Open Voice Interoperability Initiative Project
☆19Updated 2 weeks ago
Alternatives and similar repositories for openfloor-docs
Users that are interested in openfloor-docs are comparing it to the libraries listed below
Sorting:
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆13Updated 4 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 11 months ago
- A simple, accessible and offline real-time transcription app for Android.☆11Updated 11 months ago
- Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, b…☆21Updated 11 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated 3 weeks ago
- Anaouder mouezh e Brezhoneg gant Vosk☆14Updated 2 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- Indic-Conformer models for ASR☆18Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 5 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated 10 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆18Updated 11 months ago
- Repo & Project for the Imminent Research Grant code & tasks☆12Updated last year
- ☆17Updated 6 months ago
- Simple audio AE☆12Updated 10 months ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Updated last year
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆35Updated 8 months ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Updated 3 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆11Updated 8 months ago
- Launch your speech synthesis within one minute.☆12Updated last year
- Russian accentuator and IPA transcriber☆14Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 4 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Updated 7 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Whisper finetuning☆14Updated 5 months ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 8 months ago
- ☆13Updated 10 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆19Updated 5 years ago