lukereichold / SpeechTimestamperLinks
Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.
☆21Updated 5 years ago
Alternatives and similar repositories for SpeechTimestamper
Users that are interested in SpeechTimestamper are comparing it to the libraries listed below
Sorting:
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆172Updated 5 years ago
- CLI tool for macOS that transcribes speech to text from PCM audio data (sent via stdin) using Apple’s speech recognition API, SFSpeechRec…☆53Updated last year
- A SFSpeechRecognizer-based voice recordings transcriber for macOS☆24Updated 2 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆98Updated 3 weeks ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆79Updated 4 years ago
- Extract Markers from Final Cut Pro FCPXML☆43Updated 2 weeks ago
- A desktop app to speed up, streamline and simplify the process of creating custom karaoke videos.☆14Updated 2 months ago
- British English pronunciation dictionary☆96Updated 7 years ago
- Xiph.org’s RNNoise neural-network-based noise removal library as an Audio Unit for macOS.☆39Updated 4 years ago
- Tool to create Enhanced LRC files.☆22Updated 12 years ago
- Karaokey is a vocal remover that automatically separates the vocals and instruments. A deep learning model based on LSTMs has been traine…☆40Updated 2 years ago
- Real time background replacement on a mac os driven webcam using the DeepLabV3 neural network for image segmentation and the native CoreM…☆39Updated 3 years ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- iOS application for finding formants in spoken sounds☆61Updated 3 months ago
- A Swift framework for working with Final Cut Pro X FCPXML files easily.☆58Updated 3 years ago
- Size efficient alternative to macOS universal binaries☆71Updated 2 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆155Updated last year
- Swift wrapper for Chromaprint, the audio fingerprint library of the AcoustID project☆19Updated 11 months ago
- This is a simple demo app for Speech recognition using Objective-C☆20Updated 8 years ago
- Open source UTAU editing environment.☆11Updated last week
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 9 months ago
- OpenAI's Whisper ported to CoreML☆148Updated 2 years ago
- ☆22Updated 3 years ago
- An open-source CoreML model trained on the ESC10 dataset☆26Updated 4 years ago
- Transcribe and generate caption files (SRT and FCPXML) without manually entering time codes.☆78Updated 6 years ago
- 🎤 quick library to extract pause lengths from audio files.☆31Updated 6 years ago
- Yet another WORLD-based UTAU resampler.☆21Updated last year
- Comparison between Google's MLKit text recognition and Apple's Vision text recognition☆12Updated 4 years ago
- Example of how to use ScreenCaptureKit to record to a file☆59Updated last year
- Read, write, convert and segment WebVTT caption files in Python.☆221Updated last year