lukereichold / SpeechTimestamperLinks

Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.

☆21

Alternatives and similar repositories for SpeechTimestamper

Users that are interested in SpeechTimestamper are comparing it to the libraries listed below

Sorting:

imdatceleste / katip
A SFSpeechRecognizer-based voice recordings transcriber for macOS
☆24Updated 2 years ago
KaddaOK / KaddaOKTools
A desktop app to speed up, streamline and simplify the process of creating custom karaoke videos.
☆14Updated 10 months ago
dtinth / transcribe
CLI tool for macOS that transcribes speech to text from PCM audio data (sent via stdin) using Apple’s speech recognition API, SFSpeechRec…
☆52Updated last year
reuelk / pipeline
A Swift framework for working with Final Cut Pro X FCPXML files easily.
☆58Updated 3 years ago
tutsplus / use-the-speech-recognition-api-in-ios-10
The companion project for the Tuts+ tutorial "Using the Speech Recognition API in iOS 10".
☆11Updated 8 years ago
johnafish / whisperer
generate granular word-level captions in srt format
☆57Updated 2 years ago
py-lidbox / lidbox
End-to-end spoken language identification out of the box.
☆48Updated 4 years ago
ApayRus / frazy
Educational player with phrasal playback and parallel multi-language subtitles. Online subtitles/captions editor.
☆19Updated last year
williamleuschner / RNNoise-For-Mac
Xiph.org’s RNNoise neural-network-based noise removal library as an Audio Unit for macOS.
☆39Updated 4 years ago
Zeta611 / Video-Converter
A simple video convertor for Mac
☆20Updated 3 years ago
1Conan / nodejs-mobile
Full-fledged Node.js on iOS
☆23Updated last year
hansemannn / SpeechRecognitionExample
An example on how to use the new iOS10 API "SFSpeechRecognizer" in Swift
☆7Updated 8 years ago
wallisch / ChromaSwift
Swift wrapper for Chromaprint, the audio fingerprint library of the AcoustID project
☆17Updated 8 months ago
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆114Updated 2 years ago
aheze / QuickOCR
Drag-and-drop to find text. A work in progress.
☆13Updated 2 years ago
albinoz / ffmpeg-static-OSX
macOS Build Last Static ffmpeg
☆52Updated last year
rupakvignesh / Lyrics-to-Audio-Alignment
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…
☆92Updated 7 years ago
ookamitai / Vocalist
An OREMO replacement for macOS, completely written in Swift
☆15Updated last year
LumingYin / QuickCaption
Transcribe and generate caption files (SRT and FCPXML) without manually entering time codes.
☆77Updated 6 years ago
oatsu-gh / SimpleEnunu
Another ENUNU for enthusiasts and developers, easy to catch up with NNSVS
☆13Updated last month
UtaUtaUtau / straycat
Yet another WORLD-based UTAU resampler.
☆21Updated 10 months ago
mtynior / NikeClockIcon
Custom macOS Dock Icon with clock inspired by Nike Watch Face
☆33Updated 3 years ago
mozilla / DSAlign
DeepSpeech based forced alignment tool
☆237Updated 4 years ago
starzia / ClapIR
iOS app for measuring room acoustics with hand claps
☆45Updated last year
nahuelproietto / phoneme-recognition-ios
Phoneme recognition usign MFCC feature extraction and DTW analysis
☆15Updated 5 years ago
viktorkalyniuk / LiTranslate-iOS
Open Source iOS Translator.
☆30Updated last year
EtienneAb3d / WhisperTimeSync
Synchronize Whisper's timestamps over an existing accurate transcription
☆152Updated last year
nonstrict-hq / ScreenCaptureKit-Recording-example
Example of how to use ScreenCaptureKit to record to a file
☆52Updated last year
glut23 / webvtt-py
Read, write, convert and segment WebVTT caption files in Python.
☆214Updated 11 months ago
pock / pockkit
Core framework for building Pock widgets
☆26Updated 3 years ago