☆10Sep 12, 2024Updated last year
Alternatives and similar repositories for cmota
Users that are interested in cmota are comparing it to the libraries listed below
Sorting:
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 3 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆43Jun 27, 2023Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Jan 30, 2022Updated 4 years ago
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆77Sep 12, 2024Updated last year
- Unofficial implementation of DragDiffusion☆37Jul 7, 2023Updated 2 years ago
- Repository for the paper 'Medical diffusion on a budget: textual inversion for medical image generation'☆12Dec 11, 2024Updated last year
- Official Implementation of HIMA (COLM'25)☆19Nov 25, 2025Updated 3 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- Python Cloud generation with Pyglet and Perlin Noise☆11May 24, 2014Updated 11 years ago
- Finetuning Stable Diffusion from Diffusers☆12Mar 11, 2024Updated last year
- An efficient distillation method for flow matching models☆22Feb 1, 2026Updated last month
- Open ChatGLM Eyes to See the World☆13Mar 30, 2023Updated 2 years ago
- MedicalGPT-zh:一个基于ChatGLM的在高质量指令数据集微调的中文医疗对话语言模型☆11Apr 9, 2023Updated 2 years ago
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Jul 19, 2023Updated 2 years ago
- A lora for AI drawing, can let model drawing Pixel art style character and scenes☆12Apr 28, 2023Updated 2 years ago
- A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …☆13Jul 13, 2022Updated 3 years ago
- ☆11Nov 30, 2024Updated last year
- ☆13Nov 15, 2024Updated last year
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆202Jul 9, 2023Updated 2 years ago
- ☆22Apr 4, 2025Updated 11 months ago
- Official Repository of Recovering Dynamic 3D Sketches from Videos (CVPR 2025)☆14Updated this week
- ☆15Aug 4, 2024Updated last year
- Code for our IJCAI 2019 paper entitled "Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis"☆14Mar 29, 2022Updated 3 years ago
- CAD models at the speed of thought (or, you know, GPT-4)☆18May 14, 2024Updated last year
- Official Implementation of CAPEAM (ICCV'23)☆16Nov 30, 2024Updated last year
- Official PyTorch implementation of the paper Transformer-Based Image Generation from Scene Graphs https://arxiv.org/abs/2303.04634☆19Jan 30, 2024Updated 2 years ago
- [Unofficial Implementation] Subject-driven Video Generation via Disentangled Identity and Motion☆58Jan 5, 2026Updated 2 months ago
- ☆13Nov 29, 2024Updated last year
- Official implementation of "Positional-encoding Image Prior" (PIP)☆16Mar 1, 2023Updated 3 years ago
- PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)☆14Jun 14, 2024Updated last year
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…☆63May 16, 2024Updated last year
- ☆13Mar 22, 2022Updated 3 years ago
- Unofficial implementation of E-LatentLPIPS in Diffusion2GAN☆19Sep 5, 2024Updated last year
- code of [CVPR22] CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance☆17Jul 10, 2022Updated 3 years ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Jul 28, 2025Updated 7 months ago
- Official Python codes for the paper "Sentinel-2 Sharpening Using a Single Unsupervised Convolutional Neural Network With MTF-Based Degrad…☆16Oct 19, 2022Updated 3 years ago
- ☆15Jul 9, 2024Updated last year
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆16Aug 30, 2024Updated last year
- Joint image and Depth inpainting, ldm3d☆16Apr 28, 2024Updated last year