zouxinghao/MRCG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zouxinghao/MRCG)

zouxinghao / MRCG

a optional way to extract audio feature

☆14

Alternatives and similar repositories for MRCG

Users that are interested in MRCG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MoongMoong / MRCG_python
View on GitHub
☆10Mar 21, 2018Updated 8 years ago
jtkim-kaist / end-point-detection
View on GitHub
☆10Sep 19, 2018Updated 7 years ago
nuaalixu / pyResults
View on GitHub
A tool for calculating WER (Word Error Rate) in python.
☆14Sep 18, 2024Updated last year
IMLHF / SE_DCUNet
View on GitHub
Deep Complex UNet for speech enhancement, init from "https://github.com/chanil1218/DCUnet.pytorch"
☆13Feb 21, 2020Updated 6 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HawkAaron / mxnet-transducer
View on GitHub
Fast parallel RNN-Transducer.
☆10Nov 1, 2019Updated 6 years ago
Cocoxili / VAD
View on GitHub
Voice Activity Detection
☆29Nov 13, 2017Updated 8 years ago
roadfoodr / 6.419x_report_template
View on GitHub
Auto-generation of templates for written reports for 6.419x Data Analysis: Statistical Modeling and Computation in Applications
☆11Jun 6, 2022Updated 4 years ago
taishan1994 / pytorch_bert_coreference_resolution
View on GitHub
基于pytorch+bert的指代消解
☆14Sep 16, 2021Updated 4 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
chanil1218 / Attention-SE.pytorch
View on GitHub
An Attention-based Neural Network Approach for Single Channel Speech Enhancement
☆25Dec 1, 2019Updated 6 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
gmishne / DiffusionNet
View on GitHub
Diffusion Net TensorFlow implementation
☆10Nov 10, 2017Updated 8 years ago
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ysbsb / awesome-quantization
View on GitHub
Awesome Quantization Paper lists with Codes
☆10Feb 24, 2021Updated 5 years ago
michellecohn / praat-scripts
View on GitHub
These are various scripts to manipulate and/or measure the acoustic properties of speech sounds
☆15Oct 18, 2024Updated last year
funcwj / pydecoder
View on GitHub
A python wrapper for kaldi-online-decoder using Cython
☆12Sep 1, 2017Updated 8 years ago
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
erasedwalt / CTC-ASR
View on GitHub
An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models
☆12Nov 13, 2021Updated 4 years ago
SJTMusicTeam / MusicGeneration
View on GitHub
☆10May 15, 2021Updated 5 years ago
zhitao-wang / Hierarchical-Diffusion-Attention-Network
View on GitHub
Code for "Hierarchical Diffusion Attention Network" (IJCAI 2019)
☆14Apr 23, 2020Updated 6 years ago
EdwinYam / J-Net
View on GitHub
J-Net is aimed for audio separation with randomly weighted encoder.
☆12Oct 23, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
kamya-ai / Realtime-speech-detection
View on GitHub
Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …
☆12Jul 9, 2023Updated 3 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
ksergiou / Time-Series-Forecasting
View on GitHub
Exploratory notebook . Techniques used: FFT, ARIMA, GARCH, Monte Carlo Simulations, fbprophet, LSTM, WaveNet.
☆12Jul 11, 2022Updated 4 years ago
yoyolicoris / kazane
View on GitHub
Simple sinc interpolation in PyTorch.
☆15Jul 8, 2023Updated 3 years ago
fakufaku / separake
View on GitHub
Echo aware source separation
☆13May 29, 2018Updated 8 years ago
Erutan-pku / DNN_TA_PKU
View on GitHub
北京大学深度学习的技术与应用课程Projects
☆13May 3, 2017Updated 9 years ago
opheadacheh / Multi-view-neural-acoustic-words-embeddings
View on GitHub
☆27Apr 21, 2017Updated 9 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
andabi / voice-disciminator
View on GitHub
A neural network for filtering target speaker's voice from audio written in tensorflow
☆21Jun 21, 2018Updated 8 years ago
RicherMans / SpokenLanguageClassifiers
View on GitHub
Pretrained spoken language classifiers from audio.
☆10Jan 21, 2021Updated 5 years ago
arief25ramadhan / sound-source-localization
View on GitHub
Four neural network architectures to classify sound source direction
☆11Oct 3, 2020Updated 5 years ago
zekarias-tilahun / graph-surgeon
View on GitHub
A PyTorch implementation of "Self-Supervised GNN that Jointly Learns to Augment" or "Jointly Learnable Data Augmentations for Self-Superv…
☆13Dec 13, 2021Updated 4 years ago
keonlee9420 / Deep-Learning-TTS-Template
View on GitHub
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
☆14Jun 15, 2021Updated 5 years ago
yongxuUSTC / DNN-for-speech-enhancement
View on GitHub
DNN-for-speech-enhancement
☆176Feb 23, 2023Updated 3 years ago
Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago