Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.
☆11Dec 20, 2018Updated 7 years ago
Alternatives and similar repositories for mfcc_boaw
Users that are interested in mfcc_boaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An easy-to-use tool to extract frames from video and store into database.☆32Jan 4, 2019Updated 7 years ago
- 1st Place Solution to 2019WAIC hackthon Garbage Classification Challenge☆15Sep 10, 2019Updated 6 years ago
- ☆36Nov 4, 2022Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Apr 9, 2022Updated 3 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Jun 16, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆15Jul 2, 2019Updated 6 years ago
- Leverage 3D video and Spatial Audio to deliver an immersive experience.☆11Oct 11, 2023Updated 2 years ago
- In photographic media, faces are often obfuscated to protect the identity of those pictured. This obfuscation process is done by removing…☆12Jan 10, 2019Updated 7 years ago
- ☆17Nov 17, 2020Updated 5 years ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆23Jan 16, 2020Updated 6 years ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆16Aug 27, 2025Updated 7 months ago
- ☆16Dec 17, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Mar 6, 2019Updated 7 years ago
- Tensorflow implement of paper: Sequence to Sequence: Video to Text☆88Jul 31, 2018Updated 7 years ago
- tensor-train tensor completion (T3C), which is based on tt decomposition and gradient descent.☆12Jun 27, 2018Updated 7 years ago
- Ad-hoc Video Search☆28Feb 18, 2021Updated 5 years ago
- Magnetic resonance imaging (MRI) images are known to be sparse. This is an implementation using non-convex penalty function that encourag…☆20Aug 10, 2019Updated 6 years ago
- This is a pytorch version for Non-local Neural Networks(onging)☆27May 18, 2019Updated 6 years ago
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆16Aug 17, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 7 months ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- The implementation of Sequential VLAD in Pytorch☆20Jun 20, 2019Updated 6 years ago
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- [DEPRECEATED] Piano Transformer model trained on 2.6GB of MIDI piano music☆13Oct 10, 2022Updated 3 years ago
- 对比PCA、基于LDA改进的PCA、NMF、LNMF、FNMF以及基于稀疏矩阵的判别(SRC)的人脸识别☆16May 22, 2018Updated 7 years ago
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 3 months ago
- MRI reconstruction via non-convex total variation regularization☆18Dec 26, 2019Updated 6 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15May 6, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆38Oct 8, 2024Updated last year
- AN INTERACTIVE REMOTE SENSING CHANGE ANALYSIS MODEL BASED ON MULTIMODAL INSTRUCTION TUNING☆21Jun 16, 2025Updated 9 months ago
- Face Recognition using PCA and SVM on Yale, CMU-PIE and SMAI 2013 Student Datasets☆14Dec 16, 2013Updated 12 years ago
- HAASD: A dataset of Household Appliances Abnormal Sound Detection - paper replication data☆15Sep 16, 2019Updated 6 years ago
- TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".☆41Sep 12, 2018Updated 7 years ago
- This repository contains the code for all figures in the paper "General Pitfalls of Model-agnostic Interpretation Methods for Machine Lea…☆15Aug 17, 2021Updated 4 years ago