kyegomez/Mirasol

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/Mirasol)

kyegomez / Mirasol

Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"

☆26

Alternatives and similar repositories for Mirasol

Users that are interested in Mirasol are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
lucidrains / mirasol-pytorch
View on GitHub
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆92Dec 22, 2023Updated 2 years ago
HUIZ-A / SVA
View on GitHub
☆20Apr 26, 2024Updated 2 years ago
kyegomez / forest-of-thoughts
View on GitHub
A forest of autonomous agents.
☆20Jan 27, 2025Updated last year
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆16Aug 19, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆20Updated this week
kyegomez / TeraGPT
View on GitHub
Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT
☆17Updated this week
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
kyegomez / PaLM2-VAdapter
View on GitHub
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆17Nov 11, 2024Updated last year
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
kyegomez / Sora
View on GitHub
Implementation of the premier Text to Video model from OpenAI
☆57Nov 11, 2024Updated last year
The-Swarm-Corporation / HTX-Swarm
View on GitHub
A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…
☆11Mar 18, 2025Updated last year
DISL-Lab / BalanceMix
View on GitHub
☆15Dec 12, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
The-Swarm-Corporation / Brainwave
View on GitHub
Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…
☆14Oct 6, 2025Updated 9 months ago
kyegomez / SoundStream
View on GitHub
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆13Jan 27, 2025Updated last year
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Jul 20, 2026Updated last week
borisdayma / sora-mini
View on GitHub
☆18Feb 16, 2024Updated 2 years ago
kyegomez / Pairformer
View on GitHub
Implementation of the Pairformer model used in AlphaFold 3
☆14Jul 20, 2026Updated last week
rikeilong / Bay-CAT
View on GitHub
[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…
☆59Sep 4, 2024Updated last year
kyegomez / OmniByteFormer
View on GitHub
OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…
☆16Jul 20, 2026Updated last week
kyegomez / TinyGPTV
View on GitHub
Simple Implementation of TinyGPTV in super simple Zeta lego blocks
☆16Nov 11, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LAION-AI / worldsim
View on GitHub
☆13Aug 29, 2023Updated 2 years ago
LOVISHARYX / HRV-and-GSR-as-Viable-Physiological-Markers-for-Mental-Health-Recognition
View on GitHub
Mental stress has become a standard part of day-to-day life. However, experiencing long-term and high-level stress affects the daily life…
☆16Dec 8, 2022Updated 3 years ago
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆20Updated this week
kyegomez / Infini-attention
View on GitHub
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆59Updated this week
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
lzhangbj / ASVA
View on GitHub
[ECCV 2024 Oral] Audio-Synchronized Visual Animation
☆60Mar 15, 2026Updated 4 months ago
SwiftieH / SpGAT
View on GitHub
Spectral Graph Attention Network with Fast Eigen-approximation
☆11Dec 24, 2021Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
zeyuxie29 / PicoAudio
View on GitHub
☆45Jan 13, 2025Updated last year
CASIA-IVA-Lab / VALOR
View on GitHub
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
☆311Dec 25, 2024Updated last year
abduallahmohamed / HAR-GCNN
View on GitHub
Code for HAR-GCNN: Deep Graph CNNs for Human Activity Recognition From Highly Unlabeled Mobile Sensor Data, IEEE PerCom CoMoRea 2022
☆13May 9, 2022Updated 4 years ago
ZackBradshaw / ikigAI
View on GitHub
☆13Mar 28, 2024Updated 2 years ago
GalateaWang / TSGN-master
View on GitHub
☆12Nov 14, 2024Updated last year
kyegomez / HSSS
View on GitHub
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…
☆16Nov 11, 2024Updated last year
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago