☆39Feb 27, 2026Updated this week
Alternatives and similar repositories for thinkomni
Users that are interested in thinkomni are comparing it to the libraries listed below
Sorting:
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated last week
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆12Apr 12, 2024Updated last year
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Apr 9, 2025Updated 10 months ago
- Awesome GPT-4 with Applications. This is a collection of resources related to GPT-4, including news, official documents, demo and applica…☆20Mar 15, 2023Updated 2 years ago
- the official code of DriveMonkey☆43May 24, 2025Updated 9 months ago
- ☆31Jun 14, 2024Updated last year
- The Chongqing University Bituminous Pavement Disease Detection Dataset (CQU-BPDD)☆13Apr 17, 2025Updated 10 months ago
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) a…☆13Aug 14, 2023Updated 2 years ago
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆178Oct 29, 2025Updated 4 months ago
- The code for the paper "Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders" (AAAI'24).☆37Dec 26, 2023Updated 2 years ago
- ☆12Mar 5, 2024Updated last year
- ☆14May 20, 2025Updated 9 months ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated 2 months ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆20Jan 17, 2026Updated last month
- ☆20Nov 21, 2025Updated 3 months ago
- dMel: Speech Tokenization Made Simple☆16May 13, 2025Updated 9 months ago
- ☆13Jul 28, 2024Updated last year
- A much powerful probing method to tune your model with promising performance and linear probing training cost!☆15Jul 26, 2023Updated 2 years ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆50Sep 21, 2024Updated last year
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆234Jul 14, 2025Updated 7 months ago
- Fine-grained Figure Skating dataset (FineFS) involves RGB videos and estimated skeleton data, providing rich annotations for multiple dow…☆18Sep 15, 2024Updated last year
- Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning (CVPR 2025, pytorch co…☆14Sep 29, 2025Updated 5 months ago
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆14Dec 16, 2024Updated last year
- ☆14Oct 17, 2023Updated 2 years ago
- A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions☆15Jan 22, 2026Updated last month
- ☆13Oct 9, 2024Updated last year
- [AAAI 2026] This repository is the official implementation of "ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided Alignment"…☆27Feb 12, 2026Updated 2 weeks ago
- [NCA] Official implementation of the paper Motion2Language, Unsupervised learning of synchronized semantic motion segmentation☆13Sep 9, 2024Updated last year
- A modular implementation of product of experts VAEs for multimodal data☆13Nov 15, 2021Updated 4 years ago
- ☆15Jan 22, 2024Updated 2 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- [ICCV 2025] LIRA