Aketirani / audio-mnistView external linksLinks
Gender Recognition By Voice Analysis
☆12May 24, 2025Updated 8 months ago
Alternatives and similar repositories for audio-mnist
Users that are interested in audio-mnist are comparing it to the libraries listed below
Sorting:
- Active Noise Cancellation Using Filtered Adaptive Algorithms☆44Apr 30, 2025Updated 9 months ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/java-card-desfire-emulation☆16Jan 11, 2016Updated 10 years ago
- BioVoice: a multipurpose tool for voice analysis☆11Nov 13, 2020Updated 5 years ago
- Mainly on text documents. Implemented a Mini Search Engine using different algorithms and then summaried documents using lexrank.☆11Jan 19, 2018Updated 8 years ago
- ☆16Jun 14, 2024Updated last year
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Mar 8, 2019Updated 6 years ago
- Here is the repo for public scripts.☆11Jul 16, 2022Updated 3 years ago
- wav2lip-api☆11Mar 16, 2023Updated 2 years ago
- ☆10Mar 23, 2019Updated 6 years ago
- Basic 3d gltf avatar controller playground using three.js, react-three-fiber☆11May 21, 2023Updated 2 years ago
- Pepper Robot Enhanced Human Interaction☆14Dec 8, 2022Updated 3 years ago
- Project takes and FLV stream coming in from a "raw" source, and could be at any point in the "live" stream. It then saves the FLV to dis…☆13Jun 24, 2017Updated 8 years ago
- GroqSharp is a C# client library that makes it easy to interact with GroqCloud. It's designed to provide a simple and flexible interface,…☆15May 7, 2024Updated last year
- Keyphrase Generation for Scientific Document Retrieval☆11Oct 2, 2020Updated 5 years ago
- ☆15Aug 4, 2020Updated 5 years ago
- The voices of different people are tested for 20 properties. These properties include mean-frequency, standard deviation, kurtosis, skew,…☆11May 21, 2018Updated 7 years ago
- Official implementation of MICCAI2023【Knowledge Boosting: Rethinking Medical Contrastive Vision-Langauge Pre-training】☆16Mar 19, 2024Updated last year
- ☆15Dec 11, 2023Updated 2 years ago
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆17Feb 8, 2024Updated 2 years ago
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆16Apr 26, 2024Updated last year
- ☆24Jan 12, 2026Updated last month
- [ACL-WS] 4th place solution to gendered pronoun resolution challenge on Kaggle☆12Jan 18, 2021Updated 5 years ago
- Tools for understanding natural language robot commands☆12Feb 21, 2021Updated 4 years ago
- Improving Medical Vision-Language Contrastive Pretraining with Semantics-aware Triage☆11Jun 25, 2023Updated 2 years ago
- ☆14Mar 11, 2023Updated 2 years ago
- 华为(鸿蒙)安装谷歌gms☆18May 23, 2025Updated 8 months ago
- Matlab tools for pathological voice analysis☆13May 12, 2023Updated 2 years ago
- Source code of our EMNLP 2022 paper: Co-guiding Net: Achieving Mutual Guidances between Multiple Intent Detection and Slot Filling via He…☆12Nov 14, 2022Updated 3 years ago
- [MICCAI2022] Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores.☆12Jun 7, 2024Updated last year
- Speech-to-text application created with React Native, Flask and Whsiper from OpenAI.☆16Nov 10, 2022Updated 3 years ago
- WhisperAPI is a fast and reliable API that transcribes video and audio files into text with support for all models and languages. It offe…☆18Oct 20, 2024Updated last year
- Semantic File Inspector ‒ RDF-based metadata extraction and semantic search☆19Mar 19, 2025Updated 10 months ago
- Natural Language Understanding (maps text into intentions and arguments)☆13Jan 30, 2019Updated 7 years ago
- An audio visualizer written using soundcard, scipy and pyqtgraph. Supports internal and microphone audio. Live waveform, buffered, hannin…☆15Jul 11, 2022Updated 3 years ago
- 🎙️ Fast, installable, in-browser audio spectrum visualizer. Support for both realtime and audio files!☆19Mar 9, 2025Updated 11 months ago
- Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built with Next.js and Tailwind CSS.…☆16Feb 9, 2024Updated 2 years ago
- Knowledge graph based information retrieval☆13Dec 26, 2018Updated 7 years ago
- ☆18Nov 13, 2021Updated 4 years ago