The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)
☆28Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for magvlt
Users that are interested in magvlt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆63Jan 15, 2024Updated 2 years ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆73May 24, 2024Updated last year
- ☆28Jan 22, 2026Updated 3 months ago
- Implementation of DCTTS with Adversarial Training☆12Dec 30, 2019Updated 6 years ago
- This is the official implementation for IVA '19 paper "Analyzing Input and Output Representations for Speech-Driven Gesture Generation".☆10Jul 12, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ☆12Dec 29, 2023Updated 2 years ago
- Template project for VR 360 Video Player series.☆15Sep 10, 2019Updated 6 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)☆16Nov 8, 2022Updated 3 years ago
- ☆18Apr 17, 2026Updated last month
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 《Python机器学习实践指南》代码和笔记☆12Aug 26, 2020Updated 5 years ago
- This is the official repository for our publication "The IVI Lab entry to the GENEA Challenge 2022 – A Tacotron2 Based Method for Co-Spee…☆13May 2, 2023Updated 3 years ago
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated 2 years ago
- A set of utilities to turn Dataclasses into useful configuration managers.☆11Mar 27, 2024Updated 2 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆57Mar 12, 2024Updated 2 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆130Jun 11, 2024Updated last year
- A framework ,for unity client,implements the network based on protobuf.☆14Feb 2, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆20Jan 2, 2025Updated last year
- AI for Frappe ERPNext☆35May 12, 2026Updated last week
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆13Mar 9, 2024Updated 2 years ago
- ☆13Sep 23, 2023Updated 2 years ago
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆31Oct 17, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- AI_School / CCTV 이상행동 감지 서비스 만들기 / (200910 ~ 200923)☆11Dec 14, 2020Updated 5 years ago
- Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis [ACMMM 2022]☆27Jun 26, 2025Updated 10 months ago
- ☆24Feb 2, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of Evolutionary and Metropolis Hastings Monte Carlo for text based (e.g. nucleotide/peptide) sequences☆13Mar 7, 2024Updated 2 years ago
- The official implementation of the ChordMixer architecture.☆62May 23, 2023Updated 2 years ago
- Source code and study data for the TOG 2021 paper: Mid-Air Drawing of Curves on 3D Surfaces in Virtual Reality.☆23Mar 22, 2022Updated 4 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 8 months ago
- ☆14May 31, 2023Updated 2 years ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- This is the official repo of CVPR 2024 paper "Multimodal Sense-Informed Prediction of 3D Human Motions"☆25May 31, 2024Updated last year