gillesdemey / google-speech-v2
Reverse Engineering Google's Speech To Text API (v2)
☆469Updated 8 years ago
Alternatives and similar repositories for google-speech-v2:
Users that are interested in google-speech-v2 are comparing it to the libraries listed below
- Javascript API for the Google Text-to-Speech engine☆319Updated 2 years ago
- Automatic video summaries☆264Updated 6 years ago
- Ruby Periscope API client☆139Updated 9 years ago
- ☆525Updated 2 years ago
- Microphone in the browser using WebRTC and WebSockets☆138Updated 4 years ago
- Speech recognition in JavaScript and WebAssembly☆1,502Updated 5 years ago
- Python module installed with setup.py☆337Updated 2 years ago
- A Python implementation of Amazon's Alexa Voice Service.☆45Updated 7 years ago
- Cloudy Vision is an open source tool to test the image labeling capabilities of different computer vision API vendors.☆230Updated 6 years ago
- Speech Recognition with the Caffe deep learning framework, migrating to☆325Updated 6 years ago
- A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions☆312Updated 8 years ago
- These are modifications of existing recording scripts that allow recording through websockets, not just downloading Blobs☆52Updated 10 years ago
- Python library that allows you to upload photos to instagram☆145Updated 7 years ago
- A Speaker Recognition System☆675Updated 5 years ago
- ESV Text/Audio Aligner to programmatically obtain the timings for each word in the corresponding audio☆93Updated 12 years ago
- Music Identification Program based on Shazam's methods☆110Updated 11 years ago
- Search and filter videos based on objects that appear in them using convolutional neural networks☆358Updated 8 years ago
- Read text using Google Translate TTS API☆162Updated last year
- Client code for Jasper voice computing platform☆4,541Updated last year
- Speaker recognition/identification system in Python☆75Updated 6 years ago
- Stream pcm from the browser's microphone through websockets to a node server and save to wav file.☆172Updated 5 years ago
- Ultrasonic Networking with the Web Audio API☆863Updated 7 years ago
- Acoustic model trainer for CMU Sphinx☆184Updated 4 months ago
- Face search engine☆199Updated 8 years ago
- Spying using Smartwatch and Deep Learning☆189Updated 7 years ago
- Tools for working with the CMU Pronunciation Dictionary☆56Updated 7 years ago
- Python client that interacts with the IBM Watson Speech To Text service through its WebSockets interface☆86Updated 5 years ago
- eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.☆385Updated 5 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,079Updated 10 months ago
- Exploration of using image processing algorithms in other domains☆72Updated 11 years ago