An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
☆58Mar 16, 2026Updated this week
Alternatives and similar repositories for movie-gen
Users that are interested in movie-gen are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of Meta's MovieGen models☆16Nov 25, 2025Updated 3 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…☆15Feb 27, 2025Updated last year
- ☆19Apr 16, 2025Updated 11 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- ☆13Nov 21, 2025Updated 4 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 16, 2026Updated last month
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Jan 27, 2025Updated last year
- A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…☆10Mar 18, 2025Updated last year
- A curated list of resources, libraries, tools, and communities for working with Local Large Language Models (LLMs).☆10Dec 20, 2024Updated last year
- Neuroscience Inspired Agent Reasoning Framework☆29May 19, 2025Updated 10 months ago
- An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"☆14Mar 16, 2026Updated last week
- ☆16Jul 4, 2025Updated 8 months ago
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- Traditional operating systems are reactive - they wait for user input or system events before taking action. SwarmOS breaks this paradigm…☆15Dec 6, 2024Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- ☆19Aug 11, 2025Updated 7 months ago
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆19Feb 20, 2025Updated last year
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Sep 11, 2024Updated last year
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated this week
- Reward Guided Latent Consistency Distillation☆27Oct 9, 2024Updated last year
- [ACMMM2025] Official released code for VQA² series models☆61Oct 19, 2025Updated 5 months ago
- A small tool to help check NAT issues☆12Dec 4, 2018Updated 7 years ago
- ☆24Aug 5, 2025Updated 7 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Jan 29, 2024Updated 2 years ago
- This comprehensive course teaches students how to build, deploy, and manage autonomous agents for enterprise workflows using the Swarms l…☆17Dec 22, 2025Updated 3 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆52Sep 14, 2024Updated last year
- A Matlab toolbox for examining the quality of structural (SNR) and functional (tSNR, SFNR) MRI☆13Apr 1, 2020Updated 5 years ago
- Analysis of video quality datasets via design of minimalistic video quality models☆24Jul 15, 2024Updated last year
- Low level software graphics library by ErrorSoft (ESLGL)☆19Apr 7, 2018Updated 7 years ago
- A WordPress plugin starter template for coding with AI IDEs, like; Augment Code, Cursor, Windsurf, Loveable, Bolt, Cline, Roo Code, etc☆14Updated this week
- SkyXEngine - движок для создания 3D игр с real-time рендером, использует технологии DirectX 11.☆16Updated this week
- This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language M…☆24Apr 27, 2025Updated 10 months ago
- ☆18Jan 5, 2025Updated last year
- ☆20Apr 26, 2024Updated last year
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- ☆12Oct 21, 2019Updated 6 years ago
- Load FLEX by pressing down both volume buttons.☆16Apr 30, 2024Updated last year