winstxnhdw / CapGenLinks
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
☆11Updated this week
Alternatives and similar repositories for CapGen
Users that are interested in CapGen are comparing it to the libraries listed below
Sorting:
- The Full-stack web framework to meet the developer's expectation.☆16Updated 2 years ago
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- This repository is the project page for "Point Anywhere: Directed Object Estimation from Omnidirectional Images", including source code …☆12Updated 2 years ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- Sample and Computation Redistribution for Efficient Face Detection☆15Updated last year
- Redis Queue Dashboard based on FastAPI☆121Updated 3 weeks ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- A simple uv workspace☆18Updated 9 months ago
- ☆17Updated 2 years ago
- ☆16Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- FastAPI backend to upload files to S3☆27Updated 5 years ago
- ☆12Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Reflex select component which allows the user to search for options and create new ones.☆14Updated last year
- Talking Face Generation system☆19Updated 2 years ago
- python GET raw or rendered HTML (for humans)☆13Updated 5 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Updated 11 months ago
- Video chat apps with computer vision filters built on top of Streamlit☆50Updated 2 years ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- Python wrapper for the Lago Rest API☆25Updated this week
- key/value store for Python based on Cloudflare workers☆33Updated 6 months ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- Self-supervised neural network for music recommendations.☆18Updated 2 years ago
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 7 months ago
- ☆16Updated 2 years ago
- A simple Python wrapper around for Tiktok API .☆24Updated 7 months ago
- Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases s…☆25Updated last year