☆10Sep 12, 2024Updated last year
Alternatives and similar repositories for cmota
Users that are interested in cmota are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 5 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆43Jun 27, 2023Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Jan 30, 2022Updated 4 years ago
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆77Sep 12, 2024Updated last year
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CAD models at the speed of thought (or, you know, GPT-4)☆19May 14, 2024Updated last year
- Associative scan package for DRYing some code between repos☆18Jan 5, 2026Updated 4 months ago
- Official Implementation of HIMA (COLM'25)☆19Nov 25, 2025Updated 5 months ago
- Official Implementation of CAPEAM (ICCV'23)☆16Nov 30, 2024Updated last year
- A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …☆13Jul 13, 2022Updated 3 years ago
- ☆335Feb 14, 2023Updated 3 years ago
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆202Jul 9, 2023Updated 2 years ago
- [Unofficial Implementation] Subject-driven Video Generation via Disentangled Identity and Motion☆58Jan 5, 2026Updated 4 months ago
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Jul 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Nov 30, 2024Updated last year
- Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"☆14Mar 29, 2022Updated 4 years ago
- ☆14Nov 15, 2024Updated last year
- 💀 UNMAINTAINED 💀 Rust bindings to dlibs face recognition tools☆28Aug 4, 2021Updated 4 years ago
- Official repository for the ECCV 2024 paper, "CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring", ECCV 2024☆16Jul 16, 2025Updated 9 months ago
- Unofficial implementation of DragDiffusion☆37Jul 7, 2023Updated 2 years ago
- Official Repository of Recovering Dynamic 3D Sketches from Videos (CVPR 2025)☆14Mar 2, 2026Updated 2 months ago
- ☆13Nov 29, 2024Updated last year
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Apr 3, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)☆14Jun 14, 2024Updated last year
- Unofficial implementation of E-LatentLPIPS in Diffusion2GAN☆20Sep 5, 2024Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- ☆24May 28, 2023Updated 2 years ago
- MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos☆47Feb 10, 2026Updated 2 months ago
- ☆17Jan 19, 2026Updated 3 months ago
- A comprehensive SD v3.5 fine-tuning toolkit covering LoRA, DreamBooth, full fine-tuning, DDPO, GRPO, DPO and ReFL with aesthetic/text-ima…☆24Apr 4, 2025Updated last year
- ☆17Aug 8, 2024Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…☆63May 16, 2024Updated last year
- ☆15Aug 4, 2024Updated last year
- Pytorch implementation of paper "DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation", ECCV 2022.☆56Jul 14, 2022Updated 3 years ago
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆35Jul 16, 2025Updated 9 months ago
- ☆21Mar 3, 2026Updated 2 months ago
- An efficient distillation method for flow matching models☆26Feb 1, 2026Updated 3 months ago
- Finetuning Stable Diffusion from Diffusers☆11Mar 11, 2024Updated 2 years ago