apple / ml-mgie
☆3,877Updated last year
Alternatives and similar repositories for ml-mgie:
Users that are interested in ml-mgie are comparing it to the libraries listed below
- ☆5Updated 2 years ago
- ☆4Updated 2 years ago
- Examples in the MLX framework☆7,306Updated 3 weeks ago
- On-device Speech Recognition for Apple Silicon☆4,510Updated this week
- ☆8,610Updated 6 months ago
- Examples using MLX Swift☆1,674Updated this week
- ☆1,541Updated 11 months ago
- Foundational model for human-like, expressive TTS☆4,094Updated 8 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,485Updated 5 months ago
- MLX: An array framework for Apple silicon☆20,212Updated this week
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,789Updated 7 months ago
- [being rewritten] Cross-platform iMessage POC☆3,624Updated 10 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,100Updated 3 months ago
- Mora: More like Sora for Generalist Video Generation☆1,557Updated 6 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,834Updated 7 months ago
- Swift app demonstrating Core ML Stable Diffusion☆2,663Updated 9 months ago
- An Extensible Deep Learning Library☆2,020Updated this week
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,225Updated 4 months ago
- A Gradio demo of MGIE☆346Updated last year
- Official Code for Stable Cascade☆6,595Updated 8 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,129Updated last year
- CoreNet: A library for training deep neural networks☆7,006Updated 6 months ago
- Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"☆916Updated 10 months ago
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,731Updated last month
- tiny vision language model☆7,796Updated last week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,906Updated last month
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,792Updated 2 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,270Updated 11 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,306Updated last year
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,148Updated 4 months ago