apple / ml-mgieLinks
☆3,891Updated last year
Alternatives and similar repositories for ml-mgie
Users that are interested in ml-mgie are comparing it to the libraries listed below
Sorting:
- ☆8,650Updated 10 months ago
- Examples in the MLX framework☆7,789Updated last week
- An Extensible Deep Learning Library☆2,233Updated last week
- ☆1,556Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,131Updated 7 months ago
- A Gradio demo of MGIE☆347Updated last year
- Examples using MLX Swift☆2,029Updated this week
- Let us democratise high-resolution generation! (CVPR 2024)☆2,028Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,181Updated last year
- Swift app demonstrating Core ML Stable Diffusion☆2,707Updated last year
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,836Updated 11 months ago
- Generate and auto-execute Python scripts in the cli☆1,808Updated 2 weeks ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,821Updated 7 months ago
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,297Updated last year
- An intuitive GUI for GLIGEN that uses ComfyUI in the backend☆2,046Updated last year
- Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative☆4,717Updated 6 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,316Updated last year
- CoreNet: A library for training deep neural networks☆7,021Updated 2 weeks ago
- Mora: More like Sora for Generalist Video Generation☆1,568Updated 10 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,413Updated 8 months ago
- ☆2,531Updated last year
- Foundational model for human-like, expressive TTS☆4,158Updated last year
- 4M: Massively Multimodal Masked Modeling☆1,763Updated 3 months ago
- On-device Speech Recognition for Apple Silicon☆4,986Updated last week
- Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"☆943Updated last year
- Official implementation of DreaMoving☆1,802Updated last year
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,556Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,184Updated 6 months ago
- MLX: An array framework for Apple silicon☆22,094Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,654Updated 9 months ago