A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.
☆18Jan 2, 2026Updated 2 months ago
Alternatives and similar repositories for GeminiASR
Users that are interested in GeminiASR are comparing it to the libraries listed below
Sorting:
- Scripts to help easily create image pair datasets for super resolution models☆14Sep 1, 2025Updated 6 months ago
- Bringing the elasticity and convenience of hosted build, to private build servers☆12Jun 12, 2023Updated 2 years ago
- ☆10Jul 13, 2024Updated last year
- ☆25Aug 29, 2025Updated 6 months ago
- ☆25Oct 23, 2025Updated 4 months ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- ☆34Updated this week
- AI-powered video object removal (diffusion inpainting under the hood).☆21Mar 9, 2026Updated last week
- 短视频去水印微信小程序源码前端 后端☆10Dec 5, 2022Updated 3 years ago
- Code for paper "Unsupervised Noise adaptation using Data Simulation"☆14May 16, 2024Updated last year
- This project aims to develop a user-friendly graphical interface application for the quick cutting and merging of video intros and outros…☆16Jun 8, 2024Updated last year
- Python AOT Obfuscator / Python 混淆器;比字节码更好,应该。☆18Dec 28, 2024Updated last year
- ☆12Apr 18, 2025Updated 11 months ago
- [AAAI 2025] Video Diffusion Models are Strong Video Inpainter☆17Jul 21, 2025Updated 8 months ago
- Source code of the paper: Video Inpainting Localization with Contrastive Learning, IEEE SPL 2025.☆12Aug 9, 2025Updated 7 months ago
- ☆19Jul 12, 2020Updated 5 years ago
- A collection of python scripts to make minor edits to video, audio, and transcription files.☆14Mar 28, 2024Updated last year
- Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…☆16Nov 9, 2024Updated last year
- ☆15Feb 5, 2023Updated 3 years ago
- G2LP-Net: Global to Local Progressive Video Inpainting Network Dataset☆11Feb 21, 2025Updated last year
- V免签监控端☆23Oct 24, 2025Updated 4 months ago
- AI Manga Editor capable of text recognition, translation, inpainting and editing.☆21Mar 25, 2025Updated 11 months ago
- Unofficial implementation of SCP-GAN☆18Jul 4, 2023Updated 2 years ago
- Vapoursynth filter using ProPainter: Improving Propagation and Transformer for Video Inpainting☆17Jan 2, 2026Updated 2 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆26May 25, 2023Updated 2 years ago
- Bộ gõ tiếng Việt nguồn mở cho Windows 10/11☆48Updated this week
- ☆30Feb 8, 2026Updated last month
- Internal diffusion for video inpainting☆15May 19, 2025Updated 10 months ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- ☆85Oct 10, 2025Updated 5 months ago
- An Android client app that allows you to edit images using AI inpainting models.☆21Jun 1, 2024Updated last year
- ☆23Feb 20, 2026Updated last month
- Code for paper "RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought"☆103Jan 24, 2026Updated last month
- A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.☆16Jun 30, 2022Updated 3 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- FakePartsBench: 25K+ AI-generated videos with pixel- and frame-level annotations of full and partial deepfakes.☆24Aug 31, 2025Updated 6 months ago
- ☆14Feb 19, 2025Updated last year
- Blind&Invisible Watermark (图片盲水印,提取水印无须原图!) 增加图形界面☆17Feb 26, 2022Updated 4 years ago
- QRCode scanner via WebRTC☆15Mar 11, 2014Updated 12 years ago