GodModed / ai-captionsLinks
This small project uses OpenAI's whisper AI to generate captions for videos.
☆17Updated 3 years ago
Alternatives and similar repositories for ai-captions
Users that are interested in ai-captions are comparing it to the libraries listed below
Sorting:
- openai/whisper + extra features☆89Updated 3 years ago
- A simple unofficial Python3 library to interface with elevenlabs.io.☆17Updated 2 years ago
- The primary backend service for Atila apps.☆40Updated 11 months ago
- Speech to text to speech using Elevenlabs☆28Updated 2 years ago
- ☆18Updated last year
- A no-code application that enables companies to create intelligent digital assistants.☆13Updated 2 years ago
- The code for some apps built with Sieve.☆85Updated last year
- Play.ht's Text to Speech API☆92Updated 5 months ago
- An easy-to-use library and command-line tool for TTS☆15Updated 9 months ago
- An automatic movie trailer generator.☆42Updated 3 years ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Updated 2 years ago
- On-device noise suppression powered by deep learning☆82Updated 2 weeks ago
- Teach ChatGPT the Alda music programming language, show it some superb code, and consult with it to compose a melody.☆48Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆102Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆50Updated last year
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Updated 2 years ago
- ☆54Updated 3 years ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆220Updated last month
- Google Colab Notebooks for Transcription with Whisper☆24Updated 9 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated 2 years ago
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12Updated last year
- Translate any text using GPT.☆17Updated 2 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Input a YouTube video link or upload a video file and get a video with subtitles.☆124Updated last year
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆28Updated 5 months ago
- Chat with an AI simulation of anyone as easily as copy-pasting text into a folder!☆18Updated 2 years ago