☆31Sep 20, 2021Updated 4 years ago
Alternatives and similar repositories for avbert
Users that are interested in avbert are comparing it to the libraries listed below
Sorting:
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 2 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- ☆14Oct 7, 2021Updated 4 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Jan 22, 2024Updated 2 years ago
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆31Nov 1, 2024Updated last year
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago
- A curated list of different papers and datasets in various areas of audio-visual processing☆766Jan 30, 2024Updated 2 years ago
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Jan 23, 2023Updated 3 years ago
- ☆26Jan 12, 2022Updated 4 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- 4K Video Player for Raspberry Pi 5 for standalone installation☆14Nov 5, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- This is an example of how to implement ruffle in your own website.☆14Oct 15, 2023Updated 2 years ago
- Using VideoBERT to tackle video prediction☆134May 10, 2021Updated 4 years ago
- A collection of audio autoencoders, in PyTorch.☆44Mar 7, 2023Updated 3 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- Elastic-net VARMA: hyperparameter optimisation, estimation and forecasting☆11Jan 30, 2023Updated 3 years ago
- multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…☆40Apr 1, 2023Updated 2 years ago
- Skeleton Vulkan project using SDL / C++ 🔺 a "Hello Triangle" sample code demo plus examples from "Vulkan Tutorial" (iOS/macOS, Windows, …☆13Jul 11, 2024Updated last year
- FM Radio Project, RPI PICO RP2040 SDK C++☆12Feb 6, 2025Updated last year
- Raspbot V2 AI Vision Robot Car for Raspberry Pi 5☆17Sep 10, 2025Updated 5 months ago
- Repository where you can clone anything which has to do with Automation.☆10Nov 4, 2024Updated last year
- ☆20Feb 21, 2026Updated last week
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- ☆12Aug 30, 2022Updated 3 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 5 months ago
- Matlab toolbox to compute the statistics, pdf, cdf, inverse cdf and random numbers of the generalized chi-square distribution.☆12Updated this week
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,025Apr 12, 2024Updated last year
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆87Sep 13, 2021Updated 4 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- Quiz and assignment solutions for Coursera MOOC - Aerial Robotics☆13Aug 15, 2016Updated 9 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago