A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.
☆20Feb 15, 2024Updated 2 years ago
Alternatives and similar repositories for SimpleSDM-Video
Users that are interested in SimpleSDM-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.☆25Sep 23, 2024Updated last year
- A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.☆27May 28, 2025Updated 11 months ago
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆39Sep 26, 2025Updated 7 months ago
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.☆20Sep 2, 2024Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆35May 27, 2025Updated 11 months ago
- ☆28Jul 18, 2025Updated 9 months ago
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- Code implementation of RP3D-Diag☆17Nov 25, 2024Updated last year
- [EMNLP 2024] RaTEScore: A Metric for Radiology Report Generation☆65May 18, 2025Updated 11 months ago
- Universal Video Temporal Grounding with Generative Multi-modal Large Language Models☆52Mar 20, 2026Updated last month
- [ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.☆89Dec 24, 2025Updated 4 months ago
- [CVPR 2026] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence☆68Apr 17, 2026Updated 2 weeks ago
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the offical repository of LLAVIDAL☆24Oct 4, 2025Updated 7 months ago
- The official codes for "M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging"☆43Jul 28, 2025Updated 9 months ago
- ☆12Dec 6, 2024Updated last year
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated 11 months ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Apr 27, 2024Updated 2 years ago
- [ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval☆114Nov 4, 2025Updated 6 months ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 10 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆181Jul 7, 2025Updated 9 months ago
- ☆18Oct 28, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Jul 26, 2023Updated 2 years ago
- A Holistic Embodied Cognition Benchmark☆19Apr 3, 2025Updated last year
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆65Jul 22, 2025Updated 9 months ago
- Generative Models for Low Rank Video Representation and Reconstruction☆10May 20, 2019Updated 6 years ago
- [HVEI 2018] Colorizing Color Images☆12Nov 22, 2018Updated 7 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆102Nov 6, 2024Updated last year
- VORNet: Spatio-temporally Consistent Video Inpainting for Object Removal, CVPRW 2019☆12Jul 18, 2019Updated 6 years ago
- Emotion Classification on FerPlus Dataset☆10Dec 10, 2018Updated 7 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Aug 20, 2023Updated 2 years ago
- ICME'19: Removing Rain in Videos: A Large-scale Database and A Two-stream ConvLSTM Approach☆12Jul 4, 2022Updated 3 years ago
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆67Nov 19, 2024Updated last year
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆10Aug 16, 2024Updated last year
- ☆14Sep 4, 2020Updated 5 years ago
- CounTR: Transformer-based Generalised Visual Counting☆124Jul 11, 2024Updated last year