Easiest way of fine-tuning HuggingFace video classification models
☆148Mar 20, 2023Updated 2 years ago
Alternatives and similar repositories for video-transformers
Users that are interested in video-transformers are comparing it to the libraries listed below
Sorting:
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- Effective frame sampling for ML applications.☆25Aug 30, 2025Updated 6 months ago
- ☆19Jan 30, 2023Updated 3 years ago
- ☆58Dec 2, 2025Updated 3 months ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- ☆20Oct 3, 2022Updated 3 years ago
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆109Oct 24, 2023Updated 2 years ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- Run all the tests at the same time with modal.com☆11Mar 2, 2024Updated 2 years ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- This repository includes the code for HiCo (PyTorch version).☆11Sep 24, 2022Updated 3 years ago
- Python tools☆14Oct 22, 2023Updated 2 years ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆13Mar 24, 2024Updated last year
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- Summaries of machine learning papers☆12Aug 19, 2022Updated 3 years ago
- [NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems☆29Mar 24, 2023Updated 2 years ago
- ☆14Jul 2, 2024Updated last year
- 📃 A curated list of all possible resources (tools, tutorials, platforms, etc) an andrew email can get you☆13Nov 15, 2024Updated last year
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 7 months ago
- CargoCoin is designed to be a smart contract, crypto currency platform, decentralising global trade and transport. The platform target is…☆13Aug 8, 2018Updated 7 years ago
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆33Jan 26, 2026Updated last month
- Implementation of ViViT: A Video Vision Transformer☆556Jun 21, 2021Updated 4 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Jul 11, 2023Updated 2 years ago
- Official repository of "Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning", ICCV 2021☆17Aug 4, 2023Updated 2 years ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Dec 29, 2023Updated 2 years ago
- Label images with LabelImg; Object detection with detectron2☆13Aug 20, 2021Updated 4 years ago
- Anatomy-aware self-supervised learning☆11Jun 22, 2024Updated last year
- Utilities for working with videos☆13Jul 5, 2025Updated 8 months ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- 视频会议换背景⚡背景调色☆15Feb 17, 2022Updated 4 years ago
- try VAE GANLoss + SSIM loss in anomaly Detection☆14Apr 12, 2020Updated 5 years ago
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆234Aug 27, 2022Updated 3 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,683Dec 8, 2023Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Sep 20, 2023Updated 2 years ago