wa3dbk / ScribeSalad
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
☆80Updated last month
Alternatives and similar repositories for ScribeSalad
Users that are interested in ScribeSalad are comparing it to the libraries listed below
Sorting:
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Zoom Audio Transcription offline☆32Updated 4 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- Audio Book scrapper☆26Updated last year
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- Extending conceptual thinking with semantic embeddings.☆36Updated 3 years ago
- Python script which pulls audio from mp4 video and transcribes audio using google speech and cloud storage APIs, returning an srt formatt…☆86Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and…☆19Updated 2 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Quantified Self: A Personal Data Aggregator and Dashboard for Self-Trackers and Quantified Self Enthusiasts☆18Updated last year
- generate granular word-level captions in srt format☆57Updated 2 years ago
- Just an .exe that can be used for those unable to build whisper.cpp in Windows.☆42Updated 2 years ago
- ☆14Updated 2 years ago
- Turn a doc into plaintext which you can listen to using TTS☆19Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆284Updated 2 years ago
- pronunciation dictionaries for multiple languages☆87Updated 7 years ago
- Google Cloud Function that takes a url, converts the article at that url to audio using Cloud Text-To-Speech, then stores it in a Cloud S…☆23Updated 7 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆11Updated 5 years ago
- A collection of useful tools for handling speech recognition data☆30Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 7 months ago
- ☆17Updated 3 months ago
- An even smaller speech recognizer / force aligner☆32Updated 5 months ago
- Language data store and linguistic query API☆39Updated this week
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago
- Full transcripts for the Joe Rogan Experience podcast utilized in a VuePress site.☆44Updated 5 years ago