runpod-workers / worker-insanely-fast-whisperLinks
☆12Updated last month
Alternatives and similar repositories for worker-insanely-fast-whisper
Users that are interested in worker-insanely-fast-whisper are comparing it to the libraries listed below
Sorting:
- Second attempt at AI webcam, this time with OpenAI API☆39Updated last year
- faster-whisper as serverless endpoint☆105Updated 2 weeks ago
- Instant voice cloning by MyShell.☆25Updated last year
- Create your own RVC v2 dataset from a youtube video☆27Updated last year
- LoRA Explorer model to test with LoRAs using Flux.1[Dev] as the base model☆48Updated 7 months ago
- ☆39Updated 3 weeks ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆28Updated last year
- Style-Transfer: Apply the style of an image to another image☆52Updated last year
- Cog wrapper for Coqui / xtts-v2☆74Updated 6 months ago
- ☆10Updated last year
- ☆25Updated last year
- ☆30Updated last year
- Starting point to build your own custom serverless endpoint☆107Updated 3 weeks ago
- A package for analyzing content readability and virality potential.☆14Updated last year
- Talk to GPT-4 and create a story together.☆90Updated last year
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 3 weeks ago
- A template repo for training and publishing your own custom Stable Diffusion model using https://replicate.com/replicate/dreambooth☆50Updated last year
- Whisper from OpenAi and diarization with Pyannote☆43Updated last year
- Free AI Youtube Summarizer on your computer using mistral-instruct-v0.2, langchain and llama_index☆18Updated last year
- Simulates talk with an AI that can express emotions☆69Updated 10 months ago
- Create videos formatted specifically for short-form video websites.☆12Updated last year
- (CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆29Updated last year
- Build Web Datasets with Ease☆33Updated 11 months ago
- ☆18Updated 3 years ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆205Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- The purpose of this repository is to discuss on Audio transformers☆12Updated 3 weeks ago
- AI Lip Syncing application, deployed on Streamlit☆41Updated last year
- Whiteboard animation generator☆33Updated last year