☆29Jul 25, 2025Updated 7 months ago
Alternatives and similar repositories for BIMBA
Users that are interested in BIMBA are comparing it to the libraries listed below
Sorting:
- ☆27Jul 18, 2025Updated 7 months ago
- ☆30Mar 2, 2023Updated 3 years ago
- awesome-semantic-segmentation - list of awesome things around semantic segmentation☆21Apr 28, 2022Updated 3 years ago
- official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"☆22Apr 23, 2025Updated 10 months ago
- MR. Video: MapReduce is the Principle for Long Video Understanding☆31Apr 23, 2025Updated 10 months ago
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆181Dec 19, 2025Updated 2 months ago
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆31Jun 9, 2025Updated 8 months ago
- [CVPR 2023] Code for action prediction from videos☆25Mar 8, 2024Updated last year
- ☆28Feb 10, 2025Updated last year
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆56Feb 1, 2026Updated last month
- Planning for Success: Exploring LLM Long-term Planning Capabilities in Table Understanding☆17Jun 17, 2025Updated 8 months ago
- This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos☆43Nov 5, 2025Updated 4 months ago
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆38Jan 27, 2026Updated last month
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆74Oct 14, 2024Updated last year
- [ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆57Feb 2, 2026Updated last month
- ☆38Jul 24, 2023Updated 2 years ago
- Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding☆40Mar 16, 2025Updated 11 months ago
- TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness☆118Apr 9, 2025Updated 10 months ago
- ☆37Sep 16, 2024Updated last year
- [ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.☆84Dec 24, 2025Updated 2 months ago
- [3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction☆41May 21, 2021Updated 4 years ago
- ☆13Jul 3, 2024Updated last year
- Implementation for "StyleGAN-Canvas: Augmenting StyleGAN3 for Real-Time Human-AI Co-Creation"☆12May 24, 2023Updated 2 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- ☆12Feb 16, 2024Updated 2 years ago
- [CVPR 2021] FMO Deblurring Benchmark☆13Jan 12, 2022Updated 4 years ago
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆44Jul 11, 2024Updated last year
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- Information about playing Matroska files☆11Apr 15, 2024Updated last year
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆13Apr 1, 2025Updated 11 months ago
- Computational Neuroscience stuff☆13Aug 12, 2019Updated 6 years ago
- Automatically constructed lexical database for Bangla inspired from Wordnet☆11Jul 12, 2012Updated 13 years ago
- BanglaWriting: A multi-purpose offline Bangla handwriting dataset☆12Nov 18, 2020Updated 5 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago