wa3dbk / ScribeSalad
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, Jordan B. Peterson talks, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
β78Updated last month
Alternatives and similar repositories for ScribeSalad:
Users that are interested in ScribeSalad are comparing it to the libraries listed below
- π A forced aligner intended for synchronization of narrated textβ87Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversationsβ279Updated last year
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- β13Updated last year
- Extracts per-sentence subtitles + audio from a subtitle file + video file.β11Updated 5 years ago
- Audio Book scrapperβ27Updated 9 months ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β135Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio traβ¦β51Updated last year
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic aβ¦β17Updated 4 years ago
- Language data store and linguistic query APIβ39Updated this week
- π€ quick library to extract pause lengths from audio files.β31Updated 5 years ago
- The main repo for Stage Whisper β a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatiβ¦β254Updated last year
- Just an .exe that can be used for those unable to build whisper.cpp in Windows.β39Updated 2 years ago
- Data for the International Phonetic Alphabet (IPA)β27Updated 2 years ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)β66Updated last year
- SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/β57Updated last year
- A curated list of awesome OpenAI's Whisperβ96Updated last year
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Playerβ132Updated last month
- python3.6+ port of aeneasβ14Updated 3 years ago
- Zoom Audio Transcription offlineβ32Updated 4 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- Audiobook alignment for Indigenous languagesβ38Updated last month
- A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics andβ¦β19Updated 2 years ago
- Convert native orthographies to the International Phonetic Alphabetβ13Updated 2 years ago
- Scripts for building a geo-located web corpus using Common Crawl dataβ11Updated 2 months ago
- π software for creating speech recognition models.β154Updated 7 months ago
- Script to split video files into chunks based on .srt timecodesβ31Updated 7 years ago