a-nagrani / ffmpeg-commandsView external linksLinks
Collection of useful FFMPEG commands for processing audio and video files.
☆44Jan 29, 2019Updated 7 years ago
Alternatives and similar repositories for ffmpeg-commands
Users that are interested in ffmpeg-commands are comparing it to the libraries listed below
Sorting:
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Boiler plate code for Torch based ML projects☆10Jul 14, 2021Updated 4 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Jan 31, 2022Updated 4 years ago
- This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters…☆117Apr 26, 2021Updated 4 years ago
- ☆11Apr 27, 2019Updated 6 years ago
- Logging utility for ML experiments☆16Jun 18, 2022Updated 3 years ago
- Single-view 3D Prediction☆12Sep 18, 2025Updated 4 months ago
- Implementation of the VGGVox network using TensorFlow.☆16Sep 1, 2019Updated 6 years ago
- Video narrator written in Python/GTK using vlc-lib☆25Jun 22, 2022Updated 3 years ago
- Python library for building and running distributed data pipelines using Ray☆53Dec 16, 2025Updated last month
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆28Jan 28, 2025Updated last year
- EPIC-KITCHENS-55 dataset python library☆31Jun 21, 2022Updated 3 years ago
- Volume Rendering, Neural Radiance Fields, Neural Surfaces☆25Oct 21, 2025Updated 3 months ago
- 3D Gaussian Splatting and Diffusion Guided Optimization☆35Oct 21, 2025Updated 3 months ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆119Oct 9, 2023Updated 2 years ago
- Minimal API for receptive field calculation in PyTorch☆68Sep 15, 2022Updated 3 years ago
- (ShaderFarc) Shader Port from DIVA to MMD.☆13Jan 13, 2025Updated last year
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Mar 11, 2021Updated 4 years ago
- BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues, ECCV 2020☆86Sep 14, 2021Updated 4 years ago
- Website-based resource monitor for Slurm system☆37Apr 6, 2023Updated 2 years ago
- Unsupervised Learning of Multi-Frame Optical Flow with Occlusions☆43Nov 29, 2018Updated 7 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Dec 19, 2017Updated 8 years ago
- ☆11Jul 17, 2023Updated 2 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Feb 21, 2023Updated 2 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Apr 22, 2021Updated 4 years ago
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆44Sep 11, 2024Updated last year
- Code for ECCV2018 paper: "Interpretable Intuitive Physics Model"☆43Sep 26, 2018Updated 7 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- Find the offset of an audio file within another audio file☆10Jun 2, 2022Updated 3 years ago
- An interactive semi-automatic binary segmentation model. Implemented in OpenCV 3.3.0 and Python 2.7☆11Jul 19, 2018Updated 7 years ago
- A simple virtual environment manager for Bash and Zsh☆10Jul 31, 2016Updated 9 years ago
- Code for the paper "Interpreting video features: A comparison of 3D Convolutional networks and Convolutional LSTM networks"☆11Dec 14, 2020Updated 5 years ago
- ☆10Aug 20, 2023Updated 2 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- Now that's a spicy plotter library - Python 3 port☆12Apr 28, 2021Updated 4 years ago
- ☆11Sep 18, 2017Updated 8 years ago
- AAAI'23 Workshop, A-ColViT☆11Jun 16, 2023Updated 2 years ago
- An application of genetic algorithms in pathfinding.☆10Oct 28, 2019Updated 6 years ago