speechace / speechace-api-samples
API samples for SpeechAce
☆34Updated last year
Alternatives and similar repositories for speechace-api-samples:
Users that are interested in speechace-api-samples are comparing it to the libraries listed below
- Fetch youtube user submitted or fallback to auto-generated captions☆273Updated 11 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 8 months ago
- Python script which pulls audio from mp4 video and transcribes audio using google speech and cloud storage APIs, returning an srt formatt…☆85Updated 2 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated last year
- This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) mode…☆21Updated last year
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- NLP system for predicting the reading difficulty level of a text in terms of its CEFR level.☆45Updated last month
- A best practice for streaming audio from a browser microphone to Dialogflow or Google Cloud STT by using websockets.☆143Updated last month
- Javascript Text to speech library☆217Updated last year
- A python package for deep multilingual punctuation prediction.☆113Updated 5 months ago
- A rough and ready Python utility which splits audio files based on silence and desired min/max chunk duration.☆15Updated 2 years ago
- Web Speech API☆149Updated this week
- Displays text in sync with audio being played. Works with VTT files.☆42Updated 6 years ago
- ☆35Updated 2 years ago
- ☆10Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆44Updated 2 years ago
- The CMU Pronouncing Dictionary converted to IPA☆79Updated 5 years ago
- Convert epub file to txt☆29Updated last year
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆203Updated 3 months ago
- Sample app to display live captioning to a WebRTC video session with the Deepgram API.☆36Updated 3 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆110Updated last year
- Spoken Language assessment☆42Updated 4 years ago
- Real time web based Speech-to-Text app with Streamlit☆237Updated last year
- A collection of experiments building towards a browser powered assistant☆76Updated 4 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆58Updated 2 years ago
- máobĭ (毛笔) is an Anki add-on to create cards with writing quizzes for Hanzi (Chinese characters)☆52Updated 3 months ago
- OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPU…☆75Updated this week