Yuan-ManX / ai-multimodal-timelineLinks

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥

☆36

Alternatives and similar repositories for ai-multimodal-timeline

Users that are interested in ai-multimodal-timeline are comparing it to the libraries listed below

Sorting:

kyegomez / Sora
Implementation of the premier Text to Video model from OpenAI
☆57Updated 8 months ago
okaris / grounded-segmentation
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…
☆64Updated 9 months ago
camenduru / MoE-LLaVA-jupyter
☆16Updated last year
HITsz-TMG / Anim-Director
Anim-Director: Controllable Animation Video Generation with Large Models-based Multimodal Agents
☆81Updated last month
Binxly / sd3-training
sd3 dreambooth lora training book, adapted from the diffusers doc
☆45Updated last year
martintomov / comfy-anything
Community ComfyUI workflows running on fal.ai
☆58Updated 10 months ago
camenduru / Multi-LoRA-Composition-jupyter
☆13Updated last year
camenduru / diffusers-image-outpaint-jupyter
☆16Updated 9 months ago
camenduru / FluxMusic-jupyter
☆19Updated 10 months ago
AIAnytime / Small-Multimodal-Vision-Model
Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.
☆17Updated last year
open-mmlab / Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
☆186Updated 11 months ago
camenduru / playground-colab
☆17Updated last year
TIGER-AI-Lab / VideoGenHub
A one-stop library to standardize the inference and evaluation of all the conditional video generation models.
☆48Updated 5 months ago
MetabrainAGI / Awaker2.5-VL
☆33Updated 5 months ago
Gengzigang / TokenSet
Official PyTorch implementation of TokenSet.
☆121Updated 3 months ago
camenduru / PIA-colab
☆24Updated last year
superagi / Veagle
Enhancement in Multimodal Representation Learning.
☆40Updated last year
camenduru / MiniGPT-v2-colab
☆29Updated last year
DeepAI-Research / Simverse
Synthetic data generator for image, video and 3D models
☆30Updated 11 months ago
CogNLP / CogAGENT
☆35Updated 2 years ago
01yzzyu / wikiautogen
This is the offical page of WikiAutoGen, ICCV2025
☆15Updated 2 weeks ago
camenduru / Open-Sora-jupyter
☆13Updated last year
camenduru / Depth-Anything-jupyter
☆11Updated last year
xxyQwQ / ComfyBench
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
☆177Updated 4 months ago
SLIT-AI / FuseChat-3.0
☆17Updated 2 months ago
zer0int / CLIP-SAE-finetune
Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.
☆15Updated 6 months ago
SinclairHudson / traccc
Gradio app to track objects in video and add visual effects
☆17Updated 2 weeks ago
camenduru / SSD-1B-colab
☆25Updated last year
poloclub / ClickDiffusion
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
☆69Updated last year
camenduru / YoloWorld-EfficientSAM-jupyter
☆46Updated last year