Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow
☆26Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for Acoustic-Feature-Fusion_Chime18
Users that are interested in Acoustic-Feature-Fusion_Chime18 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video classification using the UCF101 dataset for action recognition. We extract SIFT, MFCC and STIP features from the videos, we encode …☆30Dec 12, 2020Updated 5 years ago
- Tensorflow code for our paper "Lightweight Feature Fusion Network for Single Image Super-Resolution" (SPL2019)☆16Jun 1, 2019Updated 7 years ago
- Codes of ICMR 2019 short paper "Weakly Supervised Image Retrieval via Coarse-scale Feature Fusion and Multi-level Attention Blocks"☆31Oct 6, 2022Updated 3 years ago
- ☆16Mar 29, 2022Updated 4 years ago
- A 2 month Ego-vision Dataset with Autographer Wearable Camera and 2 users☆11Apr 28, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code of "Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering", ECML-PKDD 2020.☆20Dec 8, 2022Updated 3 years ago
- A PyQt GUI for ESRGAN☆14May 25, 2022Updated 4 years ago
- ☆13Mar 8, 2022Updated 4 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Nov 19, 2022Updated 3 years ago
- Video enhancement system leveraging ESRGAN and OpenCV to upscale frames and upscale videos with better visual quality.☆11Aug 4, 2023Updated 2 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Generating sound spectrograms using short-time Fourier transform that can be used for purposes such as sound classification by machine le…☆37May 9, 2021Updated 5 years ago
- 实现对视频进行简单的编辑,exe文件直接打开就能用,包括视频截取、视频高度裁剪(去字幕)、视频拼接、音频分离☆15Aug 10, 2018Updated 7 years ago
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆59Aug 8, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of ESRGAN (ECCVW'18) for the subjective quality enhancement of compressed images.☆12Jan 22, 2022Updated 4 years ago
- Python scripts to help ACs with OpenReview☆11Feb 7, 2026Updated 4 months ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- (Unofficial) Code for the paper "Certifying Some Distributional Robustness with Principled Adversarial Training"☆13May 31, 2018Updated 8 years ago
- Automatic Speech Recognition using Tensorflow☆46Aug 9, 2017Updated 8 years ago
- Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab☆44Aug 29, 2017Updated 8 years ago
- The project aims to improve the accuracy of target recognition through multi-feature fusion.Including manual feature extraction, deep lea…☆11Feb 18, 2020Updated 6 years ago
- ☆17Jul 17, 2017Updated 8 years ago
- Save jpeg images in h5py☆13May 1, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆25Jul 14, 2020Updated 5 years ago
- QVAC Fabric: cross-platform LLM inference and fine-tuning, optimized for edge devices and heterogenous GPUs☆107Updated this week
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Adaptive Sparse ViT☆16Aug 1, 2023Updated 2 years ago
- BiLSTM+CRF☆10Jan 15, 2019Updated 7 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- ☆14Oct 2, 2017Updated 8 years ago
- Codes, datasets, and features for Dynamic Collaborative Filtering with Aesthetic Feature (DCFA)☆10Nov 21, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- FreiPose: A Deep Learning Framework for Precise Animal Motion Capture in 3D Spaces☆18Apr 11, 2022Updated 4 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- ☆12Feb 23, 2021Updated 5 years ago
- Source code for paper Multi-Task Learning for Depression Detection in Dialogs (SIGDial 2022)☆12Jan 18, 2025Updated last year
- Breast tumor malignancy classification using machine learning☆14Feb 24, 2018Updated 8 years ago
- Implementation of ESRGAN using TensoFlow☆18Jan 15, 2020Updated 6 years ago
- Torch implementation of ViT based classifier for Audio classification☆12May 22, 2022Updated 4 years ago