chigozienri / cog-autocaptionView external linksLinks
Add caption to any video
☆49Feb 19, 2024Updated last year
Alternatives and similar repositories for cog-autocaption
Users that are interested in cog-autocaption are comparing it to the libraries listed below
Sorting:
- ☆19Mar 27, 2024Updated last year
- ☆24Sep 5, 2025Updated 5 months ago
- ☆31Jan 7, 2024Updated 2 years ago
- ☆12Sep 26, 2023Updated 2 years ago
- ☆12Mar 25, 2024Updated last year
- BH hackathon☆14Apr 4, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- ☆26Mar 18, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Instant voice cloning by MyShell. Join our Discord community https://discord.gg/myshell and select the Developer role upon joining to gai…☆10Dec 4, 2025Updated 2 months ago
- ☆14Feb 8, 2024Updated 2 years ago
- ☆13Oct 12, 2023Updated 2 years ago
- UnrealBakedSDF is a sample Unreal project for importing and visualizing BakedSDF meshes.☆15Jun 14, 2023Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- Add caption to any video☆209Jan 15, 2024Updated 2 years ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆34Dec 15, 2024Updated last year
- ☆12Mar 18, 2024Updated last year
- ImageBind One Embedding Space to Bind Them All☆26May 19, 2023Updated 2 years ago
- Convert an audio file to a waveform video☆11Nov 10, 2023Updated 2 years ago
- ☆17Feb 1, 2024Updated 2 years ago
- [ICLR 2024] Code for FreeNoise based on LaVie☆34Jan 28, 2024Updated 2 years ago
- ☆16Dec 18, 2023Updated 2 years ago
- Taming Stable Diffusion for Lip Sync!☆14Mar 18, 2025Updated 10 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Jul 20, 2023Updated 2 years ago
- ☆17Jan 2, 2024Updated 2 years ago
- ☆15Dec 11, 2024Updated last year
- Speech AI training and inference tools☆36Jun 25, 2023Updated 2 years ago
- ☆14Oct 16, 2023Updated 2 years ago
- ☆17Dec 5, 2023Updated 2 years ago
- animatediff prompt travel☆19Jan 27, 2024Updated 2 years ago
- MMD viewer powered by Babylon.js and babylon-mmd☆16Aug 2, 2025Updated 6 months ago
- ☆15Jan 8, 2024Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆15Mar 12, 2024Updated last year
- ☆15Sep 15, 2023Updated 2 years ago
- ☆17Jan 10, 2024Updated 2 years ago
- A notebook running TensorRT's StableDiffusion demo on Google Colaboratory☆18Feb 1, 2023Updated 3 years ago
- MJCF Importer Extension☆18Jul 24, 2025Updated 6 months ago
- ☆13Oct 30, 2023Updated 2 years ago