aklos / gpt3-personal-assistant
Interact with GPT-3 through speech
☆13Updated 2 years ago
Alternatives and similar repositories for gpt3-personal-assistant:
Users that are interested in gpt3-personal-assistant are comparing it to the libraries listed below
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆37Updated this week
- Simple PyTorch Denoisers for Waveform Audio☆34Updated 2 months ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- ☆18Updated 2 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- The application allows users to record speech, transcribe it using the Whisper ASR (Automatic Speech Recognition) model, translate the tr…☆13Updated last year
- A simple voice conversion tool☆17Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated this week
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- A very basic demonstration connecting speech recognition and text-to-speech☆19Updated 4 years ago
- Tunable pipelines☆31Updated last week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆22Updated 2 weeks ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆53Updated last year
- an improved version of Real-time-voice-cloning☆48Updated 11 months ago
- Code for OpenAI Whisper Web App Demo☆94Updated 2 years ago