☆24Dec 26, 2024Updated last year
Alternatives and similar repositories for SliMM
Users that are interested in SliMM are comparing it to the libraries listed below
Sorting:
- ☆21Jan 17, 2025Updated last year
- [AAAI-25] Official repository of "Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object De…☆20Dec 27, 2024Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆80Jun 17, 2024Updated last year
- code for downloading videos from HowTo100M dataset☆17May 13, 2021Updated 4 years ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆52Jun 16, 2025Updated 8 months ago
- [AAAI 2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆56Jul 8, 2025Updated 7 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆426Jun 20, 2025Updated 8 months ago
- ☆134Dec 22, 2023Updated 2 years ago
- Visual Instruction Tuning for Qwen2 Base Model☆41Jun 29, 2024Updated last year
- a python lib for neural networks, file and image processing etc.☆10Feb 11, 2020Updated 6 years ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆125Oct 14, 2025Updated 4 months ago
- Autoencoder for multi-label classification using Google's Tensorflow framework and MDMR for feature selection.☆10Aug 31, 2017Updated 8 years ago
- Feature Pyramid Networks for Object Detection on caffe☆10Nov 8, 2017Updated 8 years ago
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 5 months ago
- ☆33Jan 9, 2026Updated last month
- Action recognition based on action graph, which describes the spatio-temporal relationship between dense trajectory clusters. The program…☆11Jan 7, 2015Updated 11 years ago
- Contrastive Continual Learning with Importance Sampling and Prototype-Instance Relation Distillation☆12Jul 22, 2024Updated last year
- Blind First-Order Perspective Distortion Correction using Parallel Convolutional Neural Networks☆12Nov 19, 2021Updated 4 years ago
- PyTorch implementation of the computer vision related part of the paper "Unsupervised Data Augmentation for Consistency Training".☆10Mar 26, 2020Updated 5 years ago
- ☆10Jul 5, 2024Updated last year
- A PyTorch implementation of the "Image Inpainting for Irregular Holes Using Partial Convolutions" paper from Liu et al at NVIDIA☆10Aug 24, 2019Updated 6 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆51Jul 13, 2022Updated 3 years ago
- OpenCV detection of Visible Light Communication (VLC) transmitter LED detection in Python☆12Jan 18, 2018Updated 8 years ago
- Implement spike-drive using OR residual connection and propose SynA attention for natural pruning.☆12Mar 31, 2024Updated last year
- ☆12Jun 7, 2022Updated 3 years ago
- extending Hadoop to support video analytic applications☆10Mar 26, 2015Updated 10 years ago
- ☆20Aug 14, 2025Updated 6 months ago
- An efficient spiking variational autoencoder☆13Nov 13, 2023Updated 2 years ago
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 3 months ago
- python script to use darknet yolo library☆13Sep 24, 2018Updated 7 years ago
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- This is a Pytorch implementation for paper "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"☆11Apr 22, 2020Updated 5 years ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆321Jul 9, 2024Updated last year
- Source code for abnormal detection on MIT video surveillance dataset using Nonnegative Matrix Factorization☆11May 10, 2020Updated 5 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆37Dec 22, 2025Updated 2 months ago
- ☆14Aug 7, 2017Updated 8 years ago
- miemienet is a C++ AI deep learning inference framework.Supports PPYOLOE、PICODET.☆12Nov 4, 2022Updated 3 years ago
- ☆10Mar 15, 2022Updated 3 years ago