kakaobrain/magvlt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kakaobrain/magvlt)

kakaobrain / magvlt

The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)

☆28

Alternatives and similar repositories for magvlt

Users that are interested in magvlt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ONground-Korea / 2023-AIKU_DeepLearning-Bootcamp
View on GitHub
2023-1 고려대학교 AIKU 딥러닝 방학 부트캠프: Deep into Deep
☆10Jul 10, 2023Updated 3 years ago
Vyvo-Labs / CodecHub
View on GitHub
CodecHub: A Unified Library for Codec Models
☆25Dec 24, 2025Updated 6 months ago
demegire / Parameterization-of-Hypercomplex-Multiplications
View on GitHub
This is a reproduction of the paper 'Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications wit…
☆13Aug 22, 2021Updated 4 years ago
genea-workshop / Speech_driven_gesture_generation_with_autoencoder
View on GitHub
This is the official implementation for IVA '19 paper "Analyzing Input and Output Representations for Speech-Driven Gesture Generation".
☆10Jul 12, 2022Updated 4 years ago
Yangyangii / AdvDCTTS
View on GitHub
Implementation of DCTTS with Adversarial Training
☆12Dec 30, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
mcahny / rovit
View on GitHub
RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"
☆17Aug 24, 2023Updated 2 years ago
Seung-Hun-Lee / DRANet
View on GitHub
Official code for DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation
☆44Dec 20, 2022Updated 3 years ago
Lifelong-ML / LASEM
View on GitHub
Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"
☆12Aug 17, 2021Updated 4 years ago
Miffyli / rl-human-prior-tricks
View on GitHub
Evaluating different engineering tricks that make RL work
☆15Jun 3, 2021Updated 5 years ago
WHU-ZQH / DUP
View on GitHub
☆16Mar 6, 2025Updated last year
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
beat2022dataset / beat
View on GitHub
☆13Mar 30, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Relph1119 / machine-learning-blueprints
View on GitHub
《Python机器学习实践指南》代码和笔记
☆12Aug 26, 2020Updated 5 years ago
cjerry1243 / Tacotron2-SpeechGesture
View on GitHub
This is the official repository for our publication "The IVI Lab entry to the GENEA Challenge 2022 – A Tacotron2 Based Method for Co-Spee…
☆13May 2, 2023Updated 3 years ago
neoncloud / mdctGAN
View on GitHub
Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"
☆66Jun 3, 2023Updated 3 years ago
soldni / springs
View on GitHub
A set of utilities to turn Dataclasses into useful configuration managers.
☆11Mar 27, 2024Updated 2 years ago
krafton-ai / Raon-Speech
View on GitHub
Open-source speech AI models from KRAFTON, including Raon-Speech and Raon-SpeechChat for speech understanding, generation, and real-time …
☆72Apr 7, 2026Updated 3 months ago
Vincentx15 / Equi-RC
View on GitHub
Equivariant layers for RC-complement symmetry in DNA sequence data
☆13Feb 24, 2022Updated 4 years ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
ali-chr / Semantic-aware-Knowledge-Distillation-for-Few-ShotClass-Incremental-Learning
View on GitHub
CVPR2021
☆12Mar 29, 2021Updated 5 years ago
VisualSystemsCorp / vsc_quill_delta_to_html
View on GitHub
☆20Jan 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
1DreamCollector / Tooth-and-alveolar-bone-segmentation-from-CBCT-main
View on GitHub
☆25Oct 21, 2024Updated last year
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
amirhb29 / StyleGAN2_Style-Mixing
View on GitHub
Using
☆15Dec 11, 2020Updated 5 years ago
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
Tsinghua-MARS-Lab / NeuralDubber
View on GitHub
The project page repo for Neural Dubber.
☆30Sep 20, 2023Updated 2 years ago
epicrispr-biotechnologies / evolutionary_monte_carlo_search
View on GitHub
Implementation of Evolutionary and Metropolis Hastings Monte Carlo for text based (e.g. nucleotide/peptide) sequences
☆13Mar 7, 2024Updated 2 years ago
amazon-science / explainable-trajectory-prediction
View on GitHub
Official code repository for the ICLR 2022 paper "You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction".
☆14Jul 25, 2024Updated last year
DCBIA-OrthoLab / 3DTeethSeg22_challenge
View on GitHub
Dental model seg challenge docker repository build
☆20Sep 14, 2022Updated 3 years ago
rarora7777 / curve-on-surface-drawing-vr
View on GitHub
Source code and study data for the TOG 2021 paper: Mid-Air Drawing of Curves on 3D Surfaces in Virtual Reality.
☆23Mar 22, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
heraclex12 / Viwiki-spelling
View on GitHub
A dataset for Vietnamese Spelling Correction
☆17Sep 27, 2021Updated 4 years ago
RuslanKhalitov / ChordMixer
View on GitHub
The official implementation of the ChordMixer architecture.
☆62May 23, 2023Updated 3 years ago
Anynoumsiccv9970 / G2P-DDM
View on GitHub
☆14May 31, 2023Updated 3 years ago
ShanghaiTech-IMPACT / 3D-Structure-guided-Network-for-Tooth-Alignment-in-2D-Photograph
View on GitHub
[BMVC 2023] 3D Structure-guided Network for Tooth Alignment in 2D Photograph
☆31Mar 12, 2025Updated last year
SivanDoveh / DAC
View on GitHub
Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models
☆28Nov 29, 2023Updated 2 years ago
mi2rl / CheSS
View on GitHub
☆26Jun 26, 2025Updated last year
wlgjs8 / Segmentation-based-Registration
View on GitHub
Implementation of "CBCT-Dental Scan Registration via Metal-Robust CT Segmentation"
☆28Jan 5, 2024Updated 2 years ago