Yuan-ManX / ai-multimodal-timelineLinks
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. π₯
β37Updated 6 months ago
Alternatives and similar repositories for ai-multimodal-timeline
Users that are interested in ai-multimodal-timeline are comparing it to the libraries listed below
Sorting:
- [ACL2025 Oral] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexibleβ87Updated last month
- Implementation of the premier Text to Video model from OpenAIβ56Updated 9 months ago
- Anim-Director: Controllable Animation Video Generation with Large Models-based Multimodal Agentsβ84Updated last month
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.β190Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integratβ¦β64Updated 10 months ago
- [ICCV2025] WikiAutoGen offical pageβ17Updated last month
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".β180Updated 5 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.β185Updated last year
- β193Updated last year
- Interface for GenAI-Arenaβ14Updated last year
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."β108Updated 2 weeks ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.β49Updated 5 months ago
- β34Updated 6 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructionsβ128Updated last year
- Synthetic data generator for image, video and 3D modelsβ30Updated last year
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community β¦β61Updated last week
- β35Updated 2 years ago
- Official PyTorch implementation of TokenSet.β121Updated 4 months ago
- β16Updated last year
- β69Updated last year
- β29Updated last year
- Fashion-VDM: Video Diffusion Model for Virtual Try-Onβ20Updated 9 months ago
- The official GitHub Page for MiniMaxβ49Updated last month
- β18Updated 3 months ago
- β85Updated 11 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"β130Updated 8 months ago
- β56Updated 8 months ago
- Community ComfyUI workflows running on fal.aiβ58Updated 11 months ago
- β66Updated 4 months ago
- β204Updated last year