muskanvk / Speech-to-Text
Speech Recognition in python
☆10Updated 6 years ago
Alternatives and similar repositories for Speech-to-Text:
Users that are interested in Speech-to-Text are comparing it to the libraries listed below
- A set of examples for basic audio data handling☆13Updated 4 years ago
- Facial Landmark Detection using OpenCV and Mediapipe☆11Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- end-to-end voicebot that answers open domain questions.☆10Updated 3 years ago
- Final project with multiple modules☆16Updated 4 years ago
- Handwritten Number Recognition using CNN and Character Segmentation☆18Updated 6 years ago
- OSINT tool for Instagram☆14Updated 6 years ago
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆12Updated 5 years ago
- SpeechYOLO Interspeech 2019☆43Updated 2 years ago
- ☆49Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- Collect the Best Papers from the Top Conferences, also including statistics and visualization keywords of accepted papers from Top Confer…☆16Updated 4 years ago
- A proof-of-concept for using WebSockets to send real-time webcam data to a client. Runs at ~0.1s latency.☆15Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆23Updated 3 years ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆11Updated 6 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 8 months ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- Model for Monocular Depth Estimation and Image Segmentation☆13Updated 3 years ago
- This project shows how to train a language-recognizer from scratch that is able to distinguish between German and English, robustly.☆12Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Media Forensics one-shot protocol, service wrapper, and basic client tools.☆13Updated 2 years ago
- CNN multi-label image classifier 🖼️.☆21Updated 4 years ago
- Deep Learning Compression and Acceleration SDK -- deep model compression for Edge and IoT embedded systems, and deep model acceleration f…☆20Updated 6 years ago
- Real time monitoring of highways☆19Updated 8 years ago
- Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in…☆47Updated last year
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 5 years ago
- Audio command recognition by DTW and classification☆7Updated 4 years ago
- 👤 Human Face and 🎥 Object Detection using OpenCV☆13Updated last year
- Get an OpenCV video capture from an YouTube video URL☆24Updated 6 months ago