[WIP] Code for LangToMo
☆21Mar 19, 2026Updated 3 months ago
Alternatives and similar repositories for LangToMo
Users that are interested in LangToMo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆44Feb 10, 2026Updated 4 months ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆56Jan 31, 2025Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆25Jan 9, 2025Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Jun 25, 2022Updated 4 years ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆36Jun 17, 2024Updated 2 years ago
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5☆13Jun 3, 2026Updated last month
- Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers☆21Aug 2, 2024Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 3 years ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 4 months ago
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆31Oct 27, 2025Updated 8 months ago
- Environments for Active Vision Reinforcement Learning☆30Oct 10, 2024Updated last year
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.☆27Apr 17, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆49Feb 23, 2026Updated 4 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 4 years ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆33Dec 12, 2025Updated 6 months ago
- Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.☆53Jan 4, 2025Updated last year
- ☆39Jun 2, 2026Updated last month
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆109Jun 26, 2024Updated 2 years ago
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆71Aug 4, 2024Updated last year
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling☆31Jun 24, 2026Updated last week
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- HD-EPIC Python script to download the entire datasets or parts of it☆22Oct 7, 2025Updated 8 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated 3 months ago
- [NeurIPS 2025] Code for BEAST Experiments on CALVIN and LIBERO.☆39Jan 8, 2026Updated 5 months ago
- The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)☆22Jun 25, 2025Updated last year
- Converts MimicGen dataset into LeRobot format, to train and evaluate the ACT, BC, and diffusion policies☆25Nov 19, 2024Updated last year
- The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”☆64Mar 20, 2026Updated 3 months ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆60Jan 20, 2026Updated 5 months ago
- [ICLR 2026] Official implemetation of the paper "Policy Contrastive Decoding for Robotic Foundation Models"☆28Mar 5, 2026Updated 3 months ago
- [RSS'26] HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations☆135Jun 15, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Feb 23, 2025Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆97May 21, 2023Updated 3 years ago
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆49Aug 15, 2025Updated 10 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆104Jul 31, 2024Updated last year
- QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization☆23Nov 11, 2025Updated 7 months ago
- Official Code for SGRv2 and SGR.☆33May 20, 2025Updated last year
- DTact: A Vision-Based Tactile Sensor that Measures High-Resolution 3D Geometry Directly from Darkness (ICRA'23)☆21Aug 29, 2023Updated 2 years ago