ageitgey / all-podcasts-dataset
A free dataset of (almost) all publicly available podcasts.
☆132Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for all-podcasts-dataset
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 3 weeks ago
- varied english texts for modern NLP testing☆73Updated 2 years ago
- A Corpus of Quotes☆68Updated 5 years ago
- Quill's library of open source NLP algorithms and data sets.☆51Updated 7 months ago
- A visualisation tool for Spacy using Hierplane.☆65Updated last year
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 7 months ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- NLTK Contrib☆166Updated 8 months ago
- A project that keeps history of trending topics on Twitter.☆34Updated 7 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆87Updated 6 years ago
- Character CNN model for DSL 2016☆16Updated 7 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Updated 10 years ago
- Relatively simple text classification powered by spaCy☆42Updated 9 years ago
- This repository contains research we conduct at Vocapouch we want to share with the world.☆22Updated 7 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 8 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 11 years ago
- Full transcripts for the Joe Rogan Experience podcast utilized in a VuePress site.☆39Updated 5 years ago
- All TED talks narratives extracted and cleaned.☆97Updated 6 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 10 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆97Updated 4 years ago
- Node.js application to extract the knowledge represented in Google infoboxes (aka Google Knowlege Graph Panel)☆26Updated 7 years ago
- Similarity search on Wikipedia using gensim in Python.☆61Updated 5 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆128Updated 3 years ago
- Python package + CLI to generate wordclouds of Twitter tweets.☆76Updated 4 years ago
- Textpipe: clean and extract metadata from text☆299Updated 3 years ago
- This is the code for "A Guide to CoreML for iOS" by Siraj Raval on Youtube☆51Updated 7 years ago