wa3dbk / ScribeSalad
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, Jordan B. Peterson talks, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
☆79Updated 3 months ago
Alternatives and similar repositories for ScribeSalad:
Users that are interested in ScribeSalad are comparing it to the libraries listed below
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and…☆19Updated 2 years ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆67Updated last year
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- pronunciation dictionaries for multiple languages☆86Updated 7 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆17Updated 5 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆59Updated 3 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆43Updated 4 years ago
- web based editor for subtitles and transcripts☆122Updated 6 months ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆280Updated last year
- Language data store and linguistic query API☆39Updated this week
- Script to split video files into chunks based on .srt timecodes☆31Updated 7 years ago
- 🙊 software for creating speech recognition models.☆158Updated 9 months ago
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆42Updated this week
- Convert native orthographies to the International Phonetic Alphabet☆14Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆99Updated last year
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. …☆309Updated 3 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆28Updated 2 years ago
- Zoom Audio Transcription offline☆32Updated 4 years ago
- 🎤 quick library to extract pause lengths from audio files.☆31Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆244Updated 2 years ago
- ☆68Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago