wa3dbk / ScribeSaladLinks
A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT lectures, ..). A big transcripts salad spanning history, geography, science, politics, film making and more.
☆80Updated 2 months ago
Alternatives and similar repositories for ScribeSalad
Users that are interested in ScribeSalad are comparing it to the libraries listed below
Sorting:
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- Zoom Audio Transcription offline☆32Updated 4 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Language data store and linguistic query API☆44Updated this week
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- pronunciation dictionaries for multiple languages☆88Updated 7 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆289Updated 2 years ago
- Python library for downloading closed captions(subtitles) from Youtube☆61Updated 2 years ago
- Generate captions for videos using the power of OpenAI's Whisper API☆45Updated 3 months ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Download subreddit comments☆93Updated 3 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆11Updated 5 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆37Updated this week
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆18Updated 5 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Google Cloud Function that takes a url, converts the article at that url to audio using Cloud Text-To-Speech, then stores it in a Cloud S…☆23Updated 7 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated last year
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 4 years ago
- On-device noise suppression powered by deep learning☆73Updated last week
- A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and…☆19Updated 2 years ago
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago