apple / ml-mgieLinks
☆3,889Updated last year
Alternatives and similar repositories for ml-mgie
Users that are interested in ml-mgie are comparing it to the libraries listed below
Sorting:
- ☆8,675Updated last year
- On-device Speech Recognition for Apple Silicon☆5,418Updated last month
- Examples in the MLX framework☆8,120Updated 3 weeks ago
- An Extensible Deep Learning Library☆2,311Updated this week
- 4M: Massively Multimodal Masked Modeling☆1,780Updated 7 months ago
- CoreNet: A library for training deep neural networks☆7,022Updated 3 months ago
- Foundational model for human-like, expressive TTS☆4,196Updated last year
- Official Code for Stable Cascade☆6,586Updated last year
- Examples using MLX Swift☆2,371Updated last week
- Code and dataset for photorealistic Codec Avatars driven from audio☆2,850Updated last year
- ☆1,550Updated last year
- Consistency Distilled Diff VAE☆2,206Updated 2 years ago
- An intuitive GUI for GLIGEN that uses ComfyUI in the backend☆2,048Updated last year
- A Gradio demo of MGIE☆347Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,331Updated last year
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,427Updated 10 months ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,393Updated 5 months ago
- Official implementation of DreaMoving☆1,801Updated 2 years ago
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,952Updated last month
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,840Updated 11 months ago
- Official code for "Style Aligned Image Generation via Shared Attention"☆1,312Updated 2 years ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,076Updated last year
- Mora: More like Sora for Generalist Video Generation☆1,585Updated last year
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆3,097Updated 9 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,149Updated last year
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,306Updated last year
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,328Updated last year
- Let us democratise high-resolution generation! (CVPR 2024)☆2,038Updated 3 months ago
- Generate and auto-execute Python scripts in the cli☆1,809Updated 4 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,210Updated last year