open-voice-interoperability / openfloor-docsLinks
specifications and documentation for the Open Voice Interoperability Initiative Project
☆20Updated last week
Alternatives and similar repositories for openfloor-docs
Users that are interested in openfloor-docs are comparing it to the libraries listed below
Sorting:
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆15Updated 6 months ago
- A simple, accessible and offline real-time transcription app for Android.☆13Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆19Updated last year
- Anaouder mouezh e Brezhoneg gant Vosk☆15Updated last week
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated this week
- Repo & Project for the Imminent Research Grant code & tasks☆12Updated last year
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆21Updated 2 years ago
- Simple audio AE☆13Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆15Updated 6 months ago
- Launch your speech synthesis within one minute.☆12Updated last year
- Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…☆14Updated 6 months ago
- Indic-Conformer models for ASR☆19Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 5 months ago
- Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.☆16Updated 4 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 10 months ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- ☆21Updated this week
- Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.☆20Updated last year
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 8 months ago
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Updated 5 years ago
- ☆17Updated 8 months ago
- IPA Phonetic dataset lexicon☆18Updated this week
- ☆23Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- Free Dutch voice dataset☆13Updated 4 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆21Updated 3 years ago