yonseivnl/cmota

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yonseivnl/cmota)

yonseivnl / cmota

☆10

Alternatives and similar repositories for cmota

Users that are interested in cmota are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ubc-vision / Make-A-Story
View on GitHub
Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023
☆43Jun 27, 2023Updated 3 years ago
snumprlab / isr-dpo
View on GitHub
Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)
☆23Nov 25, 2025Updated 8 months ago
adymaharana / VLCStoryGan
View on GitHub
Official code repository for the EMNLP 2021 paper
☆26Jan 30, 2022Updated 4 years ago
carpedkm / CustoMDiT
View on GitHub
PexelsCustom-1M: A Comprehensive Ecosystem for Open-Domain Customized Video Generation
☆19Jun 30, 2026Updated 3 weeks ago
lucidrains / ttt-rl
View on GitHub
Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez
☆15Apr 2, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yonseivnl / vlm-rlaif
View on GitHub
ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
☆77Sep 12, 2024Updated last year
lucidrains / assoc-scan
View on GitHub
Associative scan package for DRYing some code between repos
☆18Jan 5, 2026Updated 6 months ago
snumprlab / capeam
View on GitHub
Official Implementation of CAPEAM (ICCV'23)
☆16Nov 30, 2024Updated last year
ennucore / cadmium
View on GitHub
CAD models at the speed of thought (or, you know, GPT-4)
☆22May 14, 2024Updated 2 years ago
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 4 years ago
minrq / CGAN_Text2Video
View on GitHub
Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"
☆14Mar 29, 2022Updated 4 years ago
xichenpan / ARLDM
View on GitHub
Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
☆203Jul 9, 2023Updated 3 years ago
adymaharana / storydalle
View on GitHub
☆336Feb 14, 2023Updated 3 years ago
carpedkm / disentangled-subject-to-vid
View on GitHub
Learning Zero-Shot Subject-Driven Video Generation Using 1% Compute
☆59Jul 9, 2026Updated 2 weeks ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Hleephilip / CSG
View on GitHub
Official implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023)
☆11Jul 19, 2023Updated 3 years ago
jbistanbul / hieramamba
View on GitHub
Official Code for the paper "HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling"
☆15Apr 30, 2026Updated 2 months ago
expenses / face_recognition
View on GitHub
💀 UNMAINTAINED 💀 Rust bindings to dlibs face recognition tools
☆28Aug 4, 2021Updated 4 years ago
Multimodal-Commonsense-and-Task / CommonSense-Tasks
View on GitHub
☆11Nov 30, 2024Updated last year
Multimodal-Commonsense-and-Task / Multimodal-Representation
View on GitHub
☆14Nov 15, 2024Updated last year
Advocate99 / DragDiffusion
View on GitHub
Unofficial implementation of DragDiffusion
☆37Jul 7, 2023Updated 3 years ago
SNU-VGILab / Liv3Stroke
View on GitHub
Official Repository of Recovering Dynamic 3D Sketches from Videos (CVPR 2025)
☆14Mar 2, 2026Updated 4 months ago
rohandkn / skribble2vid
View on GitHub
☆24May 28, 2023Updated 3 years ago
SNU-VGILab / e-latentlpips
View on GitHub
Unofficial implementation of E-LatentLPIPS in Diffusion2GAN
☆20Sep 5, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ashishkumar822 / PiX
View on GitHub
PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)
☆13Jun 14, 2024Updated 2 years ago
intelpro / CMTA
View on GitHub
Official repository for the ECCV 2024 paper, "CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring", ECCV 2024
☆17Jul 16, 2025Updated last year
Multimodal-Commonsense-and-Task / Knowledge-Base-and-NLP
View on GitHub
☆13Nov 29, 2024Updated last year
princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
snumprlab / scale
View on GitHub
Official Implementation of SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models (ICML'26 …
☆15Jun 29, 2026Updated 3 weeks ago
kyegomez / KosmosG
View on GitHub
My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"
☆13Nov 11, 2024Updated last year
yonseivnl / earl
View on GitHub
☆17Jan 19, 2026Updated 6 months ago
Papple-F / csg
View on GitHub
☆17Aug 8, 2024Updated last year
Huage001 / DynaST
View on GitHub
Pytorch implementation of paper "DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation", ECCV 2022.
☆56Jul 14, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
snumprlab / hima
View on GitHub
Official Implementation of HIMA (COLM'25)
☆21Nov 25, 2025Updated 8 months ago
univ-esuty / noisecollage
View on GitHub
This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…
☆63May 16, 2024Updated 2 years ago
pablovela5620 / mini-nvs-solver
View on GitHub
☆15Aug 4, 2024Updated last year
banxxx / codex-console
View on GitHub
codex-console 是一个集成化控制台项目，支持任务管理、批量处理、数据导出、自动上传、日志查看与打包支持。
☆16Apr 8, 2026Updated 3 months ago
hoang1007 / finetuning-diffusers
View on GitHub
Finetuning Stable Diffusion from Diffusers
☆11Mar 11, 2024Updated 2 years ago
YeLuoSuiYou / openstorypp
View on GitHub
We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.
☆18Aug 30, 2024Updated last year
GX77 / LCVSL
View on GitHub
☆14Sep 28, 2023Updated 2 years ago