For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project
☆18Feb 25, 2020Updated 6 years ago
Alternatives and similar repositories for Smart-Media-Player
Users that are interested in Smart-Media-Player are comparing it to the libraries listed below
Sorting:
- Using speaker embedding for diarization in PyTorch☆17Aug 29, 2020Updated 5 years ago
- For our speech emotion recognition project☆28Mar 1, 2021Updated 5 years ago
- Speaker Diarization using GRU in PyTorch☆11Aug 29, 2020Updated 5 years ago
- Predictive modeling of users' interpersonal characteristics by the sound of their voices and manner of speaking.☆12Jun 11, 2018Updated 7 years ago
- ☆15Jan 24, 2019Updated 7 years ago
- 책 읽어주는 딥러닝을 보고 나도 만들고 싶어져서 공부하며 만드는 repository입니다.☆10Dec 8, 2022Updated 3 years ago
- ☆45Apr 5, 2019Updated 6 years ago
- Code related to my Bachelor's Thesis Project☆13Jun 17, 2016Updated 9 years ago
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- Tutorial session material of Pytest in PyCon KR 2019☆10Apr 11, 2020Updated 5 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- ☆14Feb 4, 2026Updated last month
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- ☆11Mar 12, 2019Updated 6 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- ☆15Jun 12, 2024Updated last year
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated last year
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- An Online Algorithm for Constrained Face Clustering in Videos☆12Oct 7, 2018Updated 7 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- This repository contains all resources (code, notebooks,etc) used for my Medium blog page.☆15Jan 7, 2025Updated last year
- Joe's Data Structures Library (JDL)☆13Oct 30, 2024Updated last year
- ☆13Aug 12, 2019Updated 6 years ago
- Clean python implementation of the paper "Computational Model for Linguistic Humor in Puns"☆16Feb 9, 2019Updated 7 years ago
- ☆12Feb 14, 2019Updated 7 years ago
- A small streamlit app to visualize the output of sentence clustering☆14Dec 15, 2020Updated 5 years ago
- 2019 PyCon kr tutorial: "네이버 영화 평점 데이터로 자연어처리 논문 구현 시작하기"☆13Aug 21, 2019Updated 6 years ago
- [CVPR 2018] Feedback-prop: Convolutional Neural Network Inference under Partial Evidence☆13Jun 12, 2018Updated 7 years ago
- An implementation of frequency-invariant beamformer☆15Sep 3, 2021Updated 4 years ago
- [ICASSP'23] Online speaker clustering☆17Feb 22, 2026Updated 2 weeks ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- ☆16Nov 30, 2017Updated 8 years ago
- Audio Visualizations driven by Deep Learning☆17Dec 8, 2022Updated 3 years ago
- BiLSTM-CRF model for NER☆15Jun 14, 2019Updated 6 years ago
- Model for processing text sequences with coreference annotations☆14Nov 29, 2018Updated 7 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago