Open Source Implementation of Dual Modality MAGVIT2 Tokenizer
☆25Nov 26, 2024Updated last year
Alternatives and similar repositories for O2-MAGVIT2
Users that are interested in O2-MAGVIT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- ☆18Sep 5, 2024Updated last year
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- ☆33Nov 25, 2025Updated 4 months ago
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)☆12Aug 11, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official implementation of "MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation"☆28Apr 3, 2026Updated last week
- Joint Source-Channel Coding of Images With Feedback☆13Apr 21, 2020Updated 5 years ago
- ☆16Sep 12, 2025Updated 7 months ago
- ☆13Feb 27, 2024Updated 2 years ago
- Export examples for ONNX☆11Oct 7, 2025Updated 6 months ago
- A python wrapper for Stanford CoreNLP, simple and customizable.☆13Oct 26, 2021Updated 4 years ago
- A public repository of "Generative AI Meets 6G and Beyond: Diffusion Models for Semantic Communications", which is a collection of educat…☆34Mar 23, 2026Updated 3 weeks ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- ☆17Sep 2, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Implementation of NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering☆74Updated this week
- ☆24Jul 17, 2025Updated 8 months ago
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated last year
- [NeurIPS 2023] MoVie: Visual Model-Based Policy Adaptation for View Generalization☆11Sep 22, 2023Updated 2 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆40Jun 22, 2024Updated last year
- ☆11Dec 23, 2025Updated 3 months ago
- [NeurIPS 2025] Code for BEAST Experiments on CALVIN and LIBERO.☆34Jan 8, 2026Updated 3 months ago
- ☆37Feb 16, 2021Updated 5 years ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆86Dec 5, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Convert Standard M2 format to parallel sentences.☆22Jun 20, 2020Updated 5 years ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆18Jun 6, 2024Updated last year
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 4 months ago
- ☆25Apr 7, 2021Updated 5 years ago
- Person ReID(Re Identification) in NVIDIA Jetson TX2 for real time algorithm☆28Oct 12, 2018Updated 7 years ago
- DeepMIMO dataset examples☆31May 4, 2025Updated 11 months ago
- A code to rotate yolo bboxes along with the image.☆27Apr 7, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Gearbox Assembly using Galaxea R1 - Simulation Platform based on Issac Lab☆29Dec 24, 2025Updated 3 months ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆31Nov 2, 2025Updated 5 months ago
- auto sign cursor☆20Feb 18, 2025Updated last year
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation☆49Apr 1, 2026Updated last week
- CMIVQA☆18Jun 3, 2024Updated last year
- Language/Clicking grounded SAM + VOS for real-time video object tracking☆20Jan 25, 2025Updated last year