huiwon-jang/CoordTok

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huiwon-jang/CoordTok)

huiwon-jang / CoordTok

☆38

Alternatives and similar repositories for CoordTok

Users that are interested in CoordTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

danijar / teleport
View on GitHub
Efficiently send large arrays across machines
☆15Jul 24, 2024Updated 2 years ago
LargeWorldModel / ElasticTok
View on GitHub
ElasticTok: Adaptive Tokenization for Image and Video
☆93Nov 4, 2024Updated last year
kingdy2002 / VCSE
View on GitHub
☆18Jun 8, 2023Updated 3 years ago
minnesotanlp / infoVerse
View on GitHub
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…
☆16Jun 28, 2023Updated 3 years ago
eugeneteoh / greenaug
View on GitHub
GreenAug: Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation
☆13Sep 10, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
davidbrandfonbrener / imitation_pretraining
View on GitHub
☆20May 30, 2023Updated 3 years ago
sihyun-yu / RoMA
View on GitHub
[NeurIPS'21] RoMA: Robust Model Adaptation for Offline Model-based Optimization
☆15Oct 28, 2021Updated 4 years ago
minjoong507 / Consistency-of-Video-LLM
View on GitHub
[CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"
☆16Oct 13, 2025Updated 9 months ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
hywang66 / LARP
View on GitHub
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆107Feb 11, 2025Updated last year
NVlabs / CMD
View on GitHub
[ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
☆54May 14, 2024Updated 2 years ago
mazpie / redundancy-action-spaces
View on GitHub
[RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation
☆23Jun 6, 2024Updated 2 years ago
alibaba-mmai-research / HiCo
View on GitHub
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
☆18Aug 10, 2022Updated 3 years ago
alinlab / MetaMAE
View on GitHub
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder (NeurIPS 2023)
☆10Jun 5, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
younggyoseo / CQN
View on GitHub
Coarse-to-fine Q-Network
☆59Aug 6, 2024Updated last year
younggyoseo / RE3
View on GitHub
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆69Jul 29, 2021Updated 5 years ago
SCZwangxiao / video-ReTaKe
View on GitHub
Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding
☆40Mar 16, 2025Updated last year
g-luo / vlm_cross_modal_reps
View on GitHub
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆34May 1, 2025Updated last year
younggyoseo / MV-MWM
View on GitHub
☆61Apr 16, 2023Updated 3 years ago
sungsoo-ahn / learning_what_to_defer
View on GitHub
☆24Dec 4, 2020Updated 5 years ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
NVlabs / TokenBench
View on GitHub
A Video Tokenizer Evaluation Dataset
☆158Jan 13, 2025Updated last year
happyhappy-jun / writing-driven-autoresearch
View on GitHub
Multi-agent harness + complete run record of the 1st-place entry at Ralphthon@ICML2026 — three AI agents wrote a workshop paper in 3 hour…
☆17Jul 14, 2026Updated 2 weeks ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jihoontack / GradNCP
View on GitHub
Learning Large-scale Neural Fields via Context Pruned Meta-Learning (NeurIPS 2023)
☆28Sep 24, 2023Updated 2 years ago
NAVER-INTEL-Co-Lab / gaudi-lavcap
View on GitHub
☆15Jan 24, 2025Updated last year
techmonsterwang / iLLaMA
View on GitHub
Adapting LLaMA Decoder to Vision Transformer
☆30May 20, 2024Updated 2 years ago
Roblox / SmoothCache
View on GitHub
Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated last year
thunlp / ACDiT
View on GitHub
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
☆42Jan 29, 2026Updated 5 months ago
minhoooo1 / CatMAE
View on GitHub
CatMAE
☆15Dec 13, 2023Updated 2 years ago
TencentARC / Divot
View on GitHub
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
☆87Feb 27, 2025Updated last year
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wilson1yan / teco
View on GitHub
☆132Feb 22, 2025Updated last year
choi403 / ALG
View on GitHub
Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)
☆59Feb 23, 2026Updated 5 months ago
songweige / content-debiased-fvd
View on GitHub
[CVPR 2024] On the Content Bias in Fréchet Video Distance
☆148Sep 28, 2024Updated last year
elicit / fave-dataset
View on GitHub
Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"
☆14Oct 20, 2024Updated last year
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,168Mar 20, 2025Updated last year
facebookarchive / NACS
View on GitHub
Jump to better conclusions: SCAN both left and right
☆11Jan 24, 2019Updated 7 years ago
csmile-1006 / REDS_agent
View on GitHub
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
☆19Apr 11, 2025Updated last year