kyegomez/movie-gen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/movie-gen)

kyegomez / movie-gen

An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!

☆59

Alternatives and similar repositories for movie-gen

Users that are interested in movie-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyegomez / CogNetX
View on GitHub
CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…
☆20Updated this week
kyegomez / OmniByteFormer
View on GitHub
OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…
☆15Jul 13, 2026Updated last week
kyegomez / MLXTransformer
View on GitHub
Simple Implementation of a Transformer in the new framework MLX by Apple
☆19Nov 18, 2024Updated last year
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆19Updated this week
Corleone-Huang / RealCustomProject
View on GitHub
☆19Apr 16, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
facebookresearch / MovieGenBench
View on GitHub
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
☆443Mar 8, 2025Updated last year
The-Swarm-Corporation / HTX-Swarm
View on GitHub
A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…
☆11Mar 18, 2025Updated last year
kyegomez / Mirasol
View on GitHub
Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"
☆26Jan 27, 2025Updated last year
xiefan-guo / i4vgen
View on GitHub
[arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
☆24Oct 6, 2024Updated last year
WaterPistolAI / Awesome-Local-LLM
View on GitHub
A curated list of resources, libraries, tools, and communities for working with Local Large Language Models (LLMs).
☆11Dec 20, 2024Updated last year
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆19Updated this week
SuperShivam5000 / windows-walker
View on GitHub
☆16Jul 4, 2025Updated last year
The-Swarm-Corporation / SwarmOS
View on GitHub
Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…
☆15Dec 6, 2024Updated last year
teowu / DOVER-Dev
View on GitHub
This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.
☆14Oct 29, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kyegomez / Qwen-VL
View on GitHub
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…
☆13Jan 29, 2024Updated 2 years ago
Agora-Lab-AI / OmegaViT
View on GitHub
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆15Updated this week
Atqarana / AI-Voicebot-for-Kids
View on GitHub
An interactive companion toy that engages kids with storytelling, singing, and encouragement for physical activities using advanced AI t…
☆10Oct 15, 2024Updated last year
yannqi / Draw-an-Audio-Code
View on GitHub
Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.
☆45Sep 11, 2024Updated last year
Q-Future / Visual-Question-Answering-for-Video-Quality-Assessment
View on GitHub
[ACMMM2025] Official released code for VQA² series models
☆68Apr 21, 2026Updated 3 months ago
zhang-haojie / MuSS
View on GitHub
A Large-Scale Dataset and Cinematic Narrative Benchmark for Multi-Shot Subject-to-Video Generation
☆28Jun 9, 2026Updated last month
errorcalc / ESLowGraphicsLibrary
View on GitHub
Low level software graphics library by ErrorSoft (ESLGL)
☆19Apr 7, 2018Updated 8 years ago
YannDubs / Mini_Decodable_Information_Bottleneck
View on GitHub
Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.
☆12Oct 20, 2020Updated 5 years ago
Ji4chenLi / rg-lcd
View on GitHub
Reward Guided Latent Consistency Distillation
☆26Oct 9, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tyuxie / RFM
View on GitHub
The official codebase for Reflected Flow Matching (ICML 2024)
☆24Jun 19, 2024Updated 2 years ago
dogmanet / SkyXEngine
View on GitHub
SkyXEngine - движок для создания 3D игр с real-time рендером, использует технологии DirectX 11.
☆16Updated this week
agentsea / agentd
View on GitHub
A daemon that makes a desktop OS accessible to AI agents
☆42May 29, 2025Updated last year
tocubed / ComfyUI-EvTexture
View on GitHub
☆18Jan 5, 2025Updated last year
kyegomez / BRAVE-ViT-Swarm
View on GitHub
Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"
☆26Jun 22, 2026Updated last month
HUIZ-A / SVA
View on GitHub
☆20Apr 26, 2024Updated 2 years ago
Vchitect / LiteGen
View on GitHub
A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆53Apr 23, 2026Updated 2 months ago
kbmurali / som-driven-qa-rag
View on GitHub
Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…
☆15Mar 16, 2024Updated 2 years ago
facebookresearch / MultiModalExplorer
View on GitHub
Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…
☆28May 16, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kyegomez / Ocean
View on GitHub
Ultra Fast Multi-Modality Vector Database
☆18Feb 21, 2024Updated 2 years ago
PolyPerceiver-Lab / STAV2A
View on GitHub
☆20Aug 11, 2025Updated 11 months ago
yunyikristy / skipNet
View on GitHub
☆12Oct 21, 2019Updated 6 years ago
h0x91b / gta2-resurection
View on GitHub
Only for education purposes
☆23Nov 15, 2021Updated 4 years ago
EmPACTLab / Awesome-Neuroscience-Agent-Reasoning
View on GitHub
Neuroscience Inspired Agent Reasoning Framework
☆31May 19, 2025Updated last year
kyegomez / MELLE
View on GitHub
An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"
☆16Updated this week
vicksEmmanuel / latent-gemma
View on GitHub
☆27Jan 14, 2025Updated last year