apple / ml-mgieLinks
☆3,884Updated last year
Alternatives and similar repositories for ml-mgie
Users that are interested in ml-mgie are comparing it to the libraries listed below
Sorting:
- ☆8,632Updated 8 months ago
- ☆1,556Updated last year
- A Gradio demo of MGIE☆347Updated last year
- Official Code for Stable Cascade☆6,589Updated 11 months ago
- Foundational model for human-like, expressive TTS☆4,132Updated 10 months ago
- CoreNet: A library for training deep neural networks☆7,016Updated last month
- Consistency Distilled Diff VAE☆2,190Updated last year
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,307Updated 9 months ago
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,290Updated last year
- Mora: More like Sora for Generalist Video Generation☆1,561Updated 8 months ago
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,844Updated 3 months ago
- Large Action Model framework to develop AI Web Agents☆6,080Updated 5 months ago
- Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"☆930Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,814Updated 4 months ago
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,287Updated 6 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,290Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Updated 6 months ago
- Swift app demonstrating Core ML Stable Diffusion☆2,688Updated last year
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,102Updated 4 months ago
- Examples in the MLX framework☆7,555Updated 2 weeks ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,886Updated 9 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,111Updated 5 months ago
- On-device Speech Recognition for Apple Silicon☆4,752Updated this week
- 4M: Massively Multimodal Masked Modeling☆1,740Updated 3 weeks ago
- An intuitive GUI for GLIGEN that uses ComfyUI in the backend☆2,039Updated last year
- [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing☆1,434Updated last year
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,305Updated 2 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,312Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,565Updated 7 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,666Updated last year