Open Source Implementation of Dual Modality MAGVIT2 Tokenizer
☆24Nov 26, 2024Updated last year
Alternatives and similar repositories for O2-MAGVIT2
Users that are interested in O2-MAGVIT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Sep 5, 2024Updated last year
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆64Dec 9, 2025Updated 3 months ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- Multiview Photometric Stereo (MVPS) Studio Hardware and Software for 3D Reconstruction☆26Jun 10, 2024Updated last year
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)☆12Aug 11, 2025Updated 7 months ago
- [NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging☆32Nov 4, 2025Updated 4 months ago
- 微信小程序-答题练习☆12Feb 4, 2023Updated 3 years ago
- Repo. for RLCF.☆15Apr 1, 2024Updated last year
- This repo contains the core codes for the paper "Deep Reinforcement Learning for Cost-Effective Medical Diagnosis".☆13Apr 7, 2023Updated 2 years ago
- ☆13Feb 27, 2024Updated 2 years ago
- ☆18Aug 21, 2023Updated 2 years ago
- A link prediction algorithm tailored to flow-driven spatial networks. Paper accepted @ WACV24.☆22Jan 18, 2024Updated 2 years ago
- ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]☆24Jan 31, 2026Updated last month
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography☆99Oct 15, 2024Updated last year
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- ☆11Dec 23, 2025Updated 3 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆20Feb 23, 2026Updated last month
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆86Dec 5, 2024Updated last year
- Convert Standard M2 format to parallel sentences.☆22Jun 20, 2020Updated 5 years ago
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆18Jun 6, 2024Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- Work done on my master thesis.☆22Mar 21, 2017Updated 9 years ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆55Dec 7, 2025Updated 3 months ago
- Machine learning models for multi-organ, multi-disease prediction in chest CT volumes. From paper Draelos et al. "Machine-Learning-Based …☆36Dec 8, 2022Updated 3 years ago
- ☆58May 7, 2025Updated 10 months ago
- CLIP (Contrastive Language-Image Pre-Training) in tensorflow☆12Aug 1, 2022Updated 3 years ago
- Gearbox Assembly using Galaxea R1 - Simulation Platform based on Issac Lab☆28Dec 24, 2025Updated 2 months ago
- [ICCV 2025] AbdomenAtlas 3.0 (9,262 CT volumes + medical reports). These “superhuman” reports are more accurate, detailed, standardized, …☆203Dec 31, 2025Updated 2 months ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆32Nov 2, 2025Updated 4 months ago
- 🩻 Whole-body CT segmentation made simple: 22,022 scans, 167 structures, one open solution.☆71Jan 27, 2026Updated last month
- auto sign cursor☆20Feb 18, 2025Updated last year
- ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes☆188Jul 3, 2024Updated last year
- [ICLR 2026] Official implemetation of the paper "Policy Contrastive Decoding for Robotic Foundation Models"☆26Mar 5, 2026Updated 2 weeks ago
- REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation☆50Updated this week
- CMIVQA☆18Jun 3, 2024Updated last year
- ☆40Mar 1, 2022Updated 4 years ago