apple / ml-mgieLinks
☆3,891Updated last year
Alternatives and similar repositories for ml-mgie
Users that are interested in ml-mgie are comparing it to the libraries listed below
Sorting:
- ☆8,646Updated 10 months ago
- CoreNet: A library for training deep neural networks☆7,017Updated 3 months ago
- Examples using MLX Swift☆2,001Updated 2 weeks ago
- Mora: More like Sora for Generalist Video Generation☆1,566Updated 10 months ago
- On-device Speech Recognition for Apple Silicon☆4,907Updated 2 weeks ago
- Examples in the MLX framework☆7,733Updated 2 months ago
- An Extensible Deep Learning Library☆2,227Updated this week
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,130Updated 7 months ago
- Official Code for Stable Cascade☆6,592Updated last year
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,334Updated 8 months ago
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,828Updated 11 months ago
- Foundational model for human-like, expressive TTS☆4,146Updated last year
- A Gradio demo of MGIE☆346Updated last year
- ☆1,555Updated last year
- Let us democratise high-resolution generation! (CVPR 2024)☆2,023Updated last year
- Official implementation of DreaMoving☆1,802Updated last year
- Swift app demonstrating Core ML Stable Diffusion☆2,704Updated last year
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,317Updated last year
- 4M: Massively Multimodal Masked Modeling☆1,756Updated 2 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,303Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)☆3,430Updated last year
- Consistency Distilled Diff VAE☆2,195Updated last year
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆5,084Updated 9 months ago
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,844Updated last week
- A MLX port of FLUX based on the Huggingface Diffusers implementation.☆1,521Updated last week
- On-device Image Generation for Apple Silicon☆642Updated 4 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,922Updated 11 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,047Updated last year
- [SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation☆2,985Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,823Updated 6 months ago