TtuHamg / TextToucherLinks

Official Pytorch Implementation for "TextToucher: Fine-Grained Text-to-Touch Generation" (AAAI 2025)

☆17

Alternatives and similar repositories for TextToucher

Users that are interested in TextToucher are comparing it to the libraries listed below

Sorting:

cfeng16 / UniTouch
[CVPR 2024] Binding Touch to Everything: Learning Unified Multimodal Tactile Representations
☆66Updated 9 months ago
Max-Fu / tvl
[ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment
☆85Updated 5 months ago
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆109Updated 6 months ago
WiserZhou / MTID
Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos
☆11Updated 4 months ago
Koorye / Inspire
Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆43Updated last month
lhc1224 / Cross-View-AG
Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022
☆68Updated last year
RoboDita / Dita
ICCV2025
☆140Updated 2 months ago
moojink / rlds_dataset_builder
An example RLDS dataset builder for X-embodiment dataset conversion.
☆45Updated 8 months ago
siyuhsu / vla-cache
[NeurIPS 2025] VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
☆35Updated last month
TencentARC / Moto
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆145Updated last month
Tonghe-Zhang / Awesome-Flow-RL-Papers
A collection of paper/projects that trains flow matching model/policies via RL.
☆286Updated last month
hq-King / Awesome-Affordance-Learning
☆91Updated last month
Tengbo-Yu / AnyBimanual
[ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation
☆90Updated 4 months ago
clip-rt / clip-rt
[RSS 2025] CLIP-RT : Learning Language-Conditioned Robotic Policies from Natural Language Supervision
☆28Updated 5 months ago
LostXine / open_x_pytorch_dataloader
An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment
☆18Updated 10 months ago
seervideodiffusion / SeerVideoLDM
[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models
☆33Updated last year
L2dulgi / IsCiL
[NeurIPS 24] Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
☆19Updated 3 weeks ago
OpenMOSS / VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆319Updated 3 months ago
thuml / iVideoGPT
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
☆154Updated last month
hairuoliu1 / ICLR-2025-Robotics
A list of robotics related papers accepted by ICLR'25
☆22Updated 2 months ago
silicx / GoldFromOres-BiLP
Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)
☆25Updated last year
kahnchana / LangToMo
[WIP] Code for LangToMo
☆20Updated 4 months ago
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆281Updated 3 months ago
jiaming-zhou / X-ICM
official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method
☆49Updated last week
jiangranlv / robotics_arXiv_daily
☆93Updated this week
Reagan1311 / OOAL
One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)
☆45Updated last year
BaiShuanghao / my_arXiv_daily
☆80Updated this week
pickxiguapi / Embodied-R1
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
☆98Updated 2 months ago
GuanxingLu / vlarl
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆325Updated this week
LatentActionPretraining / LAPA
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆398Updated 9 months ago