apple / ml-mgie
☆3,877Updated last year
Alternatives and similar repositories for ml-mgie
Users that are interested in ml-mgie are comparing it to the libraries listed below
Sorting:
- ☆8,617Updated 7 months ago
- CoreNet: A library for training deep neural networks☆7,007Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,985Updated 2 months ago
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,241Updated 5 months ago
- On-device Speech Recognition for Apple Silicon☆4,604Updated last week
- Foundational model for human-like, expressive TTS☆4,115Updated 9 months ago
- ☆1,544Updated last year
- An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own se…☆3,032Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,441Updated last month
- A Gradio demo of MGIE☆347Updated last year
- An Extensible Deep Learning Library☆2,046Updated this week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,309Updated last year
- Consistency Distilled Diff VAE☆2,186Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,272Updated last year
- Official Code for Stable Cascade☆6,592Updated 9 months ago
- Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model☆3,495Updated 6 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,284Updated last year
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,396Updated this week
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,651Updated last year
- ☆2,249Updated last year
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆2,958Updated last month
- Open-source and strong foundation image recognition models.☆3,226Updated 2 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,801Updated 3 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,509Updated 6 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,401Updated 5 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,272Updated 6 months ago
- 4M: Massively Multimodal Masked Modeling☆1,721Updated 2 months ago
- Examples using MLX Swift☆1,774Updated last week
- Swift app demonstrating Core ML Stable Diffusion☆2,673Updated 10 months ago
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,435Updated 2 months ago