apple / ml-mgieLinks
☆3,888Updated last year
Alternatives and similar repositories for ml-mgie
Users that are interested in ml-mgie are comparing it to the libraries listed below
Sorting:
- ☆8,672Updated last year
- Official Code for Stable Cascade☆6,581Updated last year
- ☆1,549Updated last year
- Examples in the MLX framework☆8,216Updated last week
- CoreNet: A library for training deep neural networks☆7,016Updated 4 months ago
- 4M: Massively Multimodal Masked Modeling☆1,789Updated 8 months ago
- Mora: More like Sora for Generalist Video Generation☆1,584Updated last year
- An Extensible Deep Learning Library☆2,319Updated this week
- Examples using MLX Swift☆2,413Updated 2 weeks ago
- On-device Speech Recognition for Apple Silicon☆5,574Updated 2 weeks ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Updated last year
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,855Updated last year
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,314Updated 2 years ago
- Let us democratise high-resolution generation! (CVPR 2024)☆2,046Updated 4 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,703Updated last year
- Official implementation of DreaMoving☆1,802Updated 2 years ago
- Consistency Distilled Diff VAE☆2,207Updated 2 years ago
- ☆2,551Updated last year
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,448Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,217Updated last year
- Swift app demonstrating Core ML Stable Diffusion☆2,735Updated 3 months ago
- Inference Llama 2 in one file of pure 🔥☆2,116Updated 2 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,085Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,153Updated last year
- Mac app for Ollama☆1,897Updated 10 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,307Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,499Updated 11 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,334Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,275Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,393Updated last year