For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project
☆18Feb 25, 2020Updated 6 years ago
Alternatives and similar repositories for Smart-Media-Player
Users that are interested in Smart-Media-Player are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- For our speech emotion recognition project☆28Mar 1, 2021Updated 5 years ago
- Using speaker embedding for diarization in PyTorch☆17Aug 29, 2020Updated 5 years ago
- Here we utilize the OpenCV libraries and apply the Histograms of Oriented Gradients (HOG) algorithm to create a computer vision applicati…☆18Jan 3, 2023Updated 3 years ago
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- Predictive modeling of users' interpersonal characteristics by the sound of their voices and manner of speaking.☆12Jun 11, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- This repo contains, training material, dlib implementation, tensorflow implementation and an own made complete system implementation with…☆16Mar 25, 2023Updated 3 years ago
- ☆45Apr 5, 2019Updated 6 years ago
- Tutorial session material of Pytest in PyCon KR 2019☆10Apr 11, 2020Updated 5 years ago
- ☆15Jan 24, 2019Updated 7 years ago
- 책 읽어주는 딥러닝을 보고 나도 만들고 싶어져서 공부하며 만드는 repository입니다.☆10Dec 8, 2022Updated 3 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- ☆11Mar 12, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Detection of emotion in Speech Using Convolution Neural Network☆20Mar 14, 2020Updated 6 years ago
- A Stress Annotated Dataset for Recognizing Everyday Stressors in SMS-like Conversational Systems☆14Apr 22, 2021Updated 4 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- Sentiment Analysis using logistic regression☆16Apr 19, 2014Updated 11 years ago
- tinyml library for Arduino☆16Aug 17, 2021Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- C++ AI library☆19Sep 13, 2022Updated 3 years ago
- [ICASSP'23] Online speaker clustering☆17Feb 22, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Naver sentiment movie corpus classification☆17Oct 12, 2021Updated 4 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- Audio Visualizations driven by Deep Learning☆17Dec 8, 2022Updated 3 years ago
- Open Source OpenVINO Edge developement and deployment on Google Colab using OpenDevNotebooks☆14Feb 12, 2020Updated 6 years ago
- Python code to estimate depth using stereo vision.☆15Jan 12, 2022Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- ☆16Nov 30, 2017Updated 8 years ago
- MSP430G2xx3 flashing tool supporting Linux, Windows, and Mac OS.☆20Mar 29, 2021Updated 5 years ago
- Gpu miner in development☆13Jun 7, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- End-to-end speech-to-speech translation pipeline with voice cloning (RVC) and automatic lip-sync (Wav2Lip).☆26Updated this week
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- This is a jupyter notebook with 8 different solutions for common problems of digital image processing, including object recognition and b…☆17May 29, 2017Updated 8 years ago
- Code for my masters thesis☆18Jul 6, 2023Updated 2 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Code for AccentDB.☆23May 28, 2021Updated 4 years ago