[WIP] Code for LangToMo
β20Mar 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for LangToMo
Users that are interested in LangToMo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] ποΈ LVNet.β43Feb 10, 2026Updated 2 months ago
- π€ [ICLR'25] Multimodal Video Understanding Framework (MVU)β56Jan 31, 2025Updated last year
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddingsβ11Feb 24, 2025Updated last year
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodimentβ25Jan 9, 2025Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)β37Jan 1, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- β14Jun 25, 2022Updated 3 years ago
- β18Dec 17, 2022Updated 3 years ago
- This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires pythonβ₯3.5β13Mar 17, 2026Updated 3 weeks ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"β20Apr 20, 2023Updated 2 years ago
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.β22Apr 17, 2025Updated 11 months ago
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videosβ28Oct 27, 2025Updated 5 months ago
- Environments for Active Vision Reinforcement Learningβ29Oct 10, 2024Updated last year
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulationβ39Feb 23, 2026Updated last month
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"β28Dec 12, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.β50Jan 4, 2025Updated last year
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modelingβ29Dec 3, 2025Updated 4 months ago
- β34Dec 18, 2025Updated 3 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)β108Jun 26, 2024Updated last year
- [AAAI 2026 Oral] SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulationβ39Apr 5, 2026Updated last week
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion modelsβ27Nov 2, 2024Updated last year
- HD-EPIC Python script to download the entire datasets or parts of itβ19Oct 7, 2025Updated 6 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"β27Mar 9, 2026Updated last month
- Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"β57Oct 10, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Converts MimicGen dataset into LeRobot format, to train and evaluate the ACT, BC, and diffusion policiesβ24Nov 19, 2024Updated last year
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"β54Oct 10, 2024Updated last year
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.β58Jan 20, 2026Updated 2 months ago
- [ICLR 2026] Official implemetation of the paper "Policy Contrastive Decoding for Robotic Foundation Models"β26Mar 5, 2026Updated last month
- Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"β17Oct 6, 2025Updated 6 months ago
- β21Feb 23, 2025Updated last year
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"β45Aug 15, 2025Updated 7 months ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.β96May 21, 2023Updated 2 years ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulationβ102Jul 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Code for SGRv2 and SGR.β33May 20, 2025Updated 10 months ago
- β20Mar 10, 2025Updated last year
- DTact: A Vision-Based Tactile Sensor that Measures High-Resolution 3D Geometry Directly from Darkness (ICRA'23)β20Aug 29, 2023Updated 2 years ago
- β278Mar 17, 2024Updated 2 years ago
- ROS wrapper of Nvidia Contact-graspnet model.β18Jul 3, 2023Updated 2 years ago
- [CoRL 2025] Robot Learning from Any Imagesβ34Nov 11, 2025Updated 5 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulationβ181Jun 20, 2025Updated 9 months ago