forwchen/mfcc_boaw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/forwchen/mfcc_boaw)

forwchen / mfcc_boaw

Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.

☆11

Alternatives and similar repositories for mfcc_boaw

Users that are interested in mfcc_boaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

forwchen / vid2frame
View on GitHub
An easy-to-use tool to extract frames from video and store into database.
☆32Jan 4, 2019Updated 7 years ago
17Skye17 / 2019WAIC-hackthon-Garbage-Classification
View on GitHub
1st Place Solution to 2019WAIC hackthon Garbage Classification Challenge
☆15Sep 10, 2019Updated 6 years ago
wengzejia1 / Semiformer
View on GitHub
☆36Nov 4, 2022Updated 3 years ago
17Skye17 / VideoLT
View on GitHub
Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)
☆34Apr 9, 2022Updated 4 years ago
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
forwchen / yt8m
View on GitHub
4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge
☆26Jun 16, 2017Updated 9 years ago
mynlp / cst_captioning
View on GitHub
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
☆60May 15, 2018Updated 8 years ago
katsura-jp / extruct-video-feature
View on GitHub
Extract video feature from C3D pretrained on Sports-1M and Kinetics
☆16Jul 2, 2019Updated 7 years ago
XRealityZone / Apple_DestinationVideo
View on GitHub
Leverage 3D video and Spatial Audio to deliver an immersive experience.
☆11Oct 11, 2023Updated 2 years ago
utkarshmalik211 / Reconstructing-Blurred-Human-Faces
View on GitHub
In photographic media, faces are often obfuscated to protect the identity of those pictured. This obfuscation process is done by removing…
☆11Jan 10, 2019Updated 7 years ago
mipuc / hts-engine-world
View on GitHub
☆17Nov 17, 2020Updated 5 years ago
zj15001 / MCTV_L2
View on GitHub
MRI reconstruction via non-convex total variation regularization
☆18Dec 26, 2019Updated 6 years ago
sususushi / reconstruction-network-for-video-captioning
View on GitHub
☆16Dec 17, 2018Updated 7 years ago
J11235 / machine-learning-pro
View on GitHub
☆11Mar 6, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yuanlonghao / T3C_tensor_completion
View on GitHub
tensor-train tensor completion (T3C), which is based on tt decomposition and gradient descent.
☆13Jun 27, 2018Updated 8 years ago
chenxinpeng / S2VT
View on GitHub
Tensorflow implement of paper: Sequence to Sequence: Video to Text
☆88Jul 31, 2018Updated 7 years ago
li-xirong / avs
View on GitHub
Ad-hoc Video Search
☆29Feb 18, 2021Updated 5 years ago
EvanZhuang / MRI-Reconstruction-with-Sparse-Optimization
View on GitHub
Magnetic resonance imaging (MRI) images are known to be sparse. This is an implementation using non-convex penalty function that encourag…
☆19Aug 10, 2019Updated 6 years ago
Cater5009 / Face-Recognition-With-NMF
View on GitHub
对比PCA、基于LDA改进的PCA、NMF、LNMF、FNMF以及基于稀疏矩阵的判别（SRC）的人脸识别
☆16May 22, 2018Updated 8 years ago
SangwonSUH / realtime_YAMNET
View on GitHub
Simple real-time Sound Event Detector based on YAMNet and pyaudio.
☆23Jan 16, 2020Updated 6 years ago
adxcreative / D-M
View on GitHub
The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…
☆10Feb 9, 2025Updated last year
mira-ai-lab / MUSIC-AVQA-R
View on GitHub
☆13May 21, 2024Updated 2 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
youjiangxu / seqvlad-pytorch
View on GitHub
The implementation of Sequential VLAD in Pytorch
☆20Jun 20, 2019Updated 7 years ago
tuyunbin / SRDRL
View on GitHub
[ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".
☆13Jan 16, 2022Updated 4 years ago
teddysum / korean_evaluation
View on GitHub
☆11Jun 5, 2025Updated last year
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
asigalov61 / GIGA-Piano
View on GitHub
[DEPRECEATED] Piano Transformer model trained on 2.6GB of MIDI piano music
☆13Oct 10, 2022Updated 3 years ago
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago
17Skye17 / Non-local-Neural-Networks-Pytorch
View on GitHub
This is a pytorch version for Non-local Neural Networks(onging)
☆27May 18, 2019Updated 7 years ago
mingyan08 / ProxL1-L2
View on GitHub
The source codes for the paper "Fast l1-l2 minimization via a proximal operator”
☆25Oct 3, 2020Updated 5 years ago
somayjain / FaceRecognition
View on GitHub
Face Recognition using PCA and SVM on Yale, CMU-PIE and SMAI 2013 Student Datasets
☆14Dec 16, 2013Updated 12 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JYongSmile / paper-2018-HAASD
View on GitHub
HAASD: A dataset of Household Appliances Abnormal Sound Detection - paper replication data
☆16Sep 16, 2019Updated 6 years ago
monologg / dotfiles
View on GitHub
Simple setup for personal dotfiles
☆11Jul 4, 2026Updated 3 weeks ago
slds-lmu / code_pitfalls_iml
View on GitHub
This repository contains the code for all figures in the paper "General Pitfalls of Model-agnostic Interpretation Methods for Machine Lea…
☆15Aug 17, 2021Updated 4 years ago
kanchen-usc / VIG
View on GitHub
Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
☆21Apr 6, 2020Updated 6 years ago
pomonam / AttentionCluster
View on GitHub
TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".
☆41Sep 12, 2018Updated 7 years ago
ictnlp / LNMT-CA
View on GitHub
Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".
☆15Apr 25, 2023Updated 3 years ago
hongwang600 / fashion-iq-metadata
View on GitHub
this repo contains some useful metadata for Fashion IQ challenge: https://sites.google.com/view/lingir/fashion-iq
☆15Jun 28, 2019Updated 7 years ago