apple/ml-mgie

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/ml-mgie)

apple / ml-mgie

☆3,874

Alternatives and similar repositories for ml-mgie

Users that are interested in ml-mgie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tsujuifu / pytorch_mgie
View on GitHub
A Gradio demo of MGIE
☆346Feb 23, 2024Updated 2 years ago
apple / ml-ferret
View on GitHub
☆8,678Oct 9, 2024Updated last year
GoogleCloudPlatform / localllm
View on GitHub
☆1,548Apr 25, 2024Updated 2 years ago
ml-explore / mlx
View on GitHub
MLX: An array framework for Apple silicon
☆27,648Updated this week
Stability-AI / StableCascade
View on GitHub
Official Code for Stable Cascade
☆6,544Jul 25, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,937Aug 12, 2024Updated last year
FujiwaraChoki / MoneyPrinter
View on GitHub
Automate Creation of YouTube Shorts using MoviePy.
☆13,777Mar 26, 2026Updated 3 months ago
metavoiceio / metavoice-src
View on GitHub
Foundational model for human-like, expressive TTS
☆4,203Jul 30, 2024Updated last year
LargeWorldModel / LWM
View on GitHub
Large World Model -- Modeling Text and Video with Millions Context
☆7,427Oct 19, 2024Updated last year
apple / corenet
View on GitHub
CoreNet: A library for training deep neural networks
☆7,003Oct 9, 2025Updated 9 months ago
TencentARC / PhotoMaker
View on GitHub
PhotoMaker [CVPR 2024]
☆10,099Oct 31, 2024Updated last year
YangLing0818 / RPG-DiffusionMaster
View on GitHub
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,838Feb 1, 2025Updated last year
ml-explore / mlx-examples
View on GitHub
Examples in the MLX framework
☆8,844Apr 6, 2026Updated 3 months ago
apple / ml-stable-diffusion
View on GitHub
Stable Diffusion with Core ML on Apple Silicon
☆17,948Jul 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
argmaxinc / argmax-oss-swift
View on GitHub
On-device Speech AI for Apple Silicon
☆6,281Jul 13, 2026Updated last week
jasonppy / VoiceCraft
View on GitHub
Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆8,501May 30, 2026Updated last month
instantX-research / InstantID
View on GitHub
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
☆11,976Jul 18, 2024Updated 2 years ago
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,998Apr 19, 2025Updated last year
PKU-YuanGroup / Open-Sora-Plan
View on GitHub
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆12,153Mar 8, 2026Updated 4 months ago
ali-vilab / AnyDoor
View on GitHub
Official implementations for paper: Anydoor: zero-shot object-level image customization
☆4,234Apr 8, 2024Updated 2 years ago
hpcaitech / Open-Sora
View on GitHub
Open-Sora: Democratizing Efficient Video Production for All
☆29,212Apr 9, 2026Updated 3 months ago
m87-labs / moondream
View on GitHub
tiny vision language model
☆9,875Apr 20, 2026Updated 3 months ago
cumulo-autumn / StreamDiffusion
View on GitHub
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
☆10,787Dec 4, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,512Mar 3, 2026Updated 4 months ago
TencentARC / SmartEdit
View on GitHub
Official code of SmartEdit [CVPR-2024 Highlight]
☆374Jun 21, 2024Updated 2 years ago
Doubiiu / DynamiCrafter
View on GitHub
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
☆3,006Sep 8, 2024Updated last year
ali-vilab / VGen
View on GitHub
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
☆3,156Jan 10, 2025Updated last year
Stability-AI / generative-models
View on GitHub
Generative Models by Stability AI
☆27,234Dec 16, 2025Updated 7 months ago
facebookresearch / audio2photoreal
View on GitHub
Code and dataset for photorealistic Codec Avatars driven from audio
☆2,855Sep 15, 2024Updated last year
luosiallen / latent-consistency-model
View on GitHub
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
☆4,615Jun 14, 2024Updated 2 years ago
google / gemma.cpp
View on GitHub
lightweight, standalone C++ inference engine for Google's Gemma models.
☆6,989Updated this week
mozilla-ai / llamafile
View on GitHub
Distribute and run LLMs with a single file.
☆25,416Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,817Apr 8, 2026Updated 3 months ago
AILab-CVC / YOLO-World
View on GitHub
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
☆6,476Feb 26, 2025Updated last year
karpathy / minbpe
View on GitHub
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆10,633Jul 1, 2024Updated 2 years ago
lllyasviel / sd-forge-layerdiffuse
View on GitHub
[WIP] Layer Diffusion for WebUI (via Forge)
☆4,116Aug 30, 2024Updated last year
AILab-CVC / VideoCrafter
View on GitHub
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
☆5,068Jan 9, 2026Updated 6 months ago
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
meta-llama / llama-cookbook
View on GitHub
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆18,521May 19, 2026Updated 2 months ago