sangho-vision / avbertView external linksLinks
☆31Sep 20, 2021Updated 4 years ago
Alternatives and similar repositories for avbert
Users that are interested in avbert are comparing it to the libraries listed below
Sorting:
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 2 years ago
- ☆11Feb 18, 2022Updated 3 years ago
- ☆14Oct 7, 2021Updated 4 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- ☆21Mar 22, 2023Updated 2 years ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Jan 22, 2024Updated 2 years ago
- Cross-model active contrastive coding☆22Mar 17, 2021Updated 4 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆31Nov 1, 2024Updated last year
- ☆26Jan 12, 2022Updated 4 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- Implementation of Cross-category Video Highlight Detection via Set-based Learning (ICCV 2021).☆79Aug 27, 2021Updated 4 years ago
- This is an example of how to implement ruffle in your own website.☆12Oct 15, 2023Updated 2 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- AI DJ Mix Generator - a fully automated system that creates a mix from input of songs closely resembling real life djs work. Includes adv…☆16Jul 2, 2025Updated 7 months ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Contains source code for the Udemy lecture "Applied Yocto Project using Raspberry Pi 5"☆15Jan 19, 2025Updated last year
- ☆20Updated this week
- multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…☆40Apr 1, 2023Updated 2 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- Repository implementing the lightweight split learning framework enabling edge devices to collaboratively train machine learning models w…☆10Mar 27, 2024Updated last year
- Matlab toolbox to compute the statistics, pdf, cdf, inverse cdf and random numbers of the generalized chi-square distribution.☆12Nov 16, 2025Updated 2 months ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,023Apr 12, 2024Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆87Sep 13, 2021Updated 4 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- Pytorch implementation of various token mixers; Attention Mechanisms, MLP, and etc for understanding computer vision papers and other tas…☆16Oct 7, 2024Updated last year
- 3D model converter for Anno 2070/2205/1800 with animation support (proprietary .rdm ⇄ glTF 2.0)☆14Jul 23, 2024Updated last year
- Deep learning for named entity recognition on CoNLL-2003☆10Dec 23, 2016Updated 9 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- PyTorch implementation for "Gated Transfer Network for Transfer Learning"☆11Jun 3, 2019Updated 6 years ago
- [IJCV 2025] The official implementation of "AnyPattern: Towards In-context Image Copy Detection"☆10Oct 24, 2025Updated 3 months ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago