showlab/cosmo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/showlab/cosmo)

showlab / cosmo

☆75

Alternatives and similar repositories for cosmo

Users that are interested in cosmo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TencentARC / HOSNeRF
View on GitHub
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
☆69Dec 12, 2023Updated 2 years ago
showlab / Efficient-CLS
View on GitHub
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
☆23Jan 8, 2024Updated 2 years ago
showlab / VisInContext
View on GitHub
Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
☆28Oct 30, 2024Updated last year
showlab / ROICtrl
View on GitHub
Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation
☆110Apr 16, 2025Updated last year
showlab / FQGAN
View on GitHub
FQGAN: Factorized Visual Tokenization and Generation
☆59Mar 29, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
opendatalab / CLIP-Parrot-Bias
View on GitHub
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year
showlab / EvolveDirector
View on GitHub
[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
☆52Oct 14, 2024Updated last year
CSU-JPG / TextAtlas
View on GitHub
[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
☆93Sep 27, 2025Updated 9 months ago
showlab / Show-Anything-3D
View on GitHub
Edit and Generate Anything in 3D world!
☆13Apr 15, 2023Updated 3 years ago
showlab / all-in-one
View on GitHub
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
☆281Mar 25, 2023Updated 3 years ago
showlab / ShowAnything
View on GitHub
☆83Aug 1, 2023Updated 2 years ago
showlab / Exo2Ego-V
View on GitHub
☆61Apr 28, 2025Updated last year
showlab / CLVQA
View on GitHub
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆42Mar 23, 2024Updated 2 years ago
showlab / Q2A
View on GitHub
[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
☆23Jan 30, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
video-reality-test / video-reality-test
View on GitHub
☆23May 5, 2026Updated 2 months ago
showlab / afformer
View on GitHub
Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)
☆46Jul 26, 2024Updated last year
showlab / DoraCycle
View on GitHub
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
☆31Mar 8, 2026Updated 4 months ago
youthhoo / AFA_For_Few_shot_learning
View on GitHub
☆22Jan 8, 2023Updated 3 years ago
showlab / FAR
View on GitHub
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
☆311Apr 23, 2025Updated last year
showlab / VideoLISA
View on GitHub
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆148Dec 26, 2024Updated last year
showlab / SMS
View on GitHub
[ICCV 2025] Balanced Image Stylization with Style Matching Score
☆69Mar 9, 2026Updated 4 months ago
showlab / UniRL
View on GitHub
The code repository of UniRL
☆53May 30, 2025Updated last year
zhaohengyuan1 / Genixer
View on GitHub
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
☆116Mar 21, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
weijiawu / ParaDiffusion
View on GitHub
[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆107Mar 24, 2025Updated last year
showlab / Impossible-Videos
View on GitHub
ICML 2025 - Impossible Videos
☆81Jul 23, 2025Updated last year
showlab / UniVTG
View on GitHub
[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding
☆380May 8, 2024Updated 2 years ago
showlab / D-AR
View on GitHub
the official repo for "D-AR: Diffusion via Autoregressive Models"
☆138Jan 29, 2026Updated 5 months ago
showlab / Image2Paragraph
View on GitHub
[Image 2 Text Para] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
☆822Apr 28, 2023Updated 3 years ago
RUCAIBox / Event-Bench
View on GitHub
Official code of *Towards Event-oriented Long Video Understanding*
☆12Jul 26, 2024Updated last year
showlab / T2VScore
View on GitHub
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆81Apr 10, 2024Updated 2 years ago
FingerRec / OA-Transformer
View on GitHub
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆61May 25, 2022Updated 4 years ago
showlab / AUI
View on GitHub
Computer-Use Agents as Judges for Generative UI
☆44Nov 27, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Yanqing0327 / MLLMs-Augmented
View on GitHub
The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》
☆31Mar 12, 2024Updated 2 years ago
sail-sg / ptp
View on GitHub
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆150Jun 7, 2023Updated 3 years ago
TencentARC / Mix-of-Show
View on GitHub
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
☆427May 14, 2024Updated 2 years ago
jy0205 / LaVIT
View on GitHub
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
☆603Oct 6, 2024Updated last year
showlab / EgoVLP
View on GitHub
[NeurIPS 2022] Egocentric Video-Language Pretraining
☆261May 9, 2024Updated 2 years ago
showlab / datacentric.vlp
View on GitHub
Compress conventional Vision-Language Pre-training data
☆52Sep 22, 2023Updated 2 years ago
showlab / VLog
View on GitHub
[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
☆587Mar 13, 2025Updated last year