kahnchana/LangToMo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kahnchana/LangToMo)

kahnchana / LangToMo

[WIP] Code for LangToMo

☆21

Alternatives and similar repositories for LangToMo

Users that are interested in LangToMo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kahnchana / mvu
View on GitHub
🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)
☆58Jan 31, 2025Updated last year
cfmata / CoPT
View on GitHub
[ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings
☆10Feb 24, 2025Updated last year
jongwoopark7978 / LVNet
View on GitHub
[Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.
☆44Feb 10, 2026Updated 5 months ago
LostXine / open_x_pytorch_dataloader
View on GitHub
An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment
☆25Jan 9, 2025Updated last year
kahnchana / clippy
View on GitHub
Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)
☆37Jan 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SalesforceAIResearch / FOFPred
View on GitHub
☆39Jun 2, 2026Updated last month
motional / motional-prediction-devkit
View on GitHub
☆18Dec 17, 2022Updated 3 years ago
TritonPaper / TRITON
View on GitHub
☆14Jun 25, 2022Updated 4 years ago
kkahatapitiya / LangRepo
View on GitHub
Code for our ACL 2025 paper "Language Repository for Long Video Understanding"
☆36Jun 17, 2024Updated 2 years ago
RyannDaGreat / rp
View on GitHub
This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5
☆13Jul 13, 2026Updated last week
dominickrei / PoseAwareVT
View on GitHub
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Aug 2, 2024Updated last year
elicassion / 3DTRL
View on GitHub
Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"
☆20Apr 20, 2023Updated 3 years ago
agentic-learning-ai-lab / lifelong-memory
View on GitHub
Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos
☆33Oct 27, 2025Updated 8 months ago
srijandas07 / clip_baseline_LTA_Ego4d
View on GitHub
Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)
☆15Jul 4, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
elicassion / active-gym
View on GitHub
Environments for Active Vision Reinforcement Learning
☆30Oct 10, 2024Updated last year
kahnchana / svt
View on GitHub
Official repository for "Self-Supervised Video Transformer" (CVPR'22)
☆109Jun 26, 2024Updated 2 years ago
LostXine / crossway_diffusion
View on GitHub
[ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
☆72Aug 4, 2024Updated last year
Charlotte-CharMLab / Fibottention
View on GitHub
Official Repository of "Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads"
☆17Oct 6, 2025Updated 9 months ago
jiayueru / Video2Act
View on GitHub
Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
☆31Jun 24, 2026Updated last month
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
PKU-HMI-Lab / AC-DiT
View on GitHub
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
☆48Feb 23, 2026Updated 5 months ago
kkahatapitiya / Coarse-Fine-Networks
View on GitHub
Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"
☆57Oct 10, 2021Updated 4 years ago
teee000 / ABPolicy-code
View on GitHub
ICRA2026: ABPolicy Asynchronous B-Spline Flow Policy for Real-Time and Smooth Robotic Manipulation
☆27Apr 22, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Cognition2Action-Lab / VLA-TMEE
View on GitHub
Reshaping Action Error Distributions for Reliable Vision-Language-Action Models
☆17Feb 5, 2026Updated 5 months ago
wentaoyuan / RoboPoint
View on GitHub
A Vision-Language Model for Spatial Affordance Prediction in Robotics
☆227Jul 17, 2025Updated last year
zwbx / Chain-of-Action
View on GitHub
☆18Jul 8, 2025Updated last year
world-action-verifier / wav_minigrid
View on GitHub
☆24Jul 11, 2026Updated last week
intuitive-robots / beast_calvin
View on GitHub
[NeurIPS 2025] Code for BEAST Experiments on CALVIN and LIBERO.
☆40Jan 8, 2026Updated 6 months ago
MaxDu17 / DynaGuide
View on GitHub
Repository of the DynaGuide project
☆67Dec 10, 2025Updated 7 months ago
kywch / mg2hfbot
View on GitHub
Converts MimicGen dataset into LeRobot format, to train and evaluate the ACT, BC, and diffusion policies
☆25Nov 19, 2024Updated last year
yiming-j / SPLINE-Net
View on GitHub
SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks
☆11Apr 13, 2023Updated 3 years ago
GeWu-Lab / MS-Bot
View on GitHub
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
☆22Jun 25, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
elicassion / StARformer
View on GitHub
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
☆97May 21, 2023Updated 3 years ago
LukeLIN-web / vote
View on GitHub
Vision-Language-Action Optimization with Trajectory Ensemble Voting (ICANN2026)
☆26Feb 18, 2026Updated 5 months ago
Robert-gyj / Prediction_with_Action
View on GitHub
Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.
☆55Jan 4, 2025Updated last year
yfujimura / WildSplatter
View on GitHub
This is the official implementation of "WildSplatter: Feed-forward 3D Gaussian Splatting with Appearance Control from Unconstrained Image…
☆24Jun 29, 2026Updated 3 weeks ago
ADL-X / LLAVIDAL
View on GitHub
This is the offical repository of LLAVIDAL
☆25Oct 4, 2025Updated 9 months ago
rai-opensource / theia
View on GitHub
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
☆277Nov 6, 2025Updated 8 months ago
TongZhangTHU / sgr
View on GitHub
Official Code for SGRv2 and SGR.
☆33May 20, 2025Updated last year