kohjiaxuan / Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
☆19Updated 2 years ago
Alternatives and similar repositories for Wikipedia-Article-Scraper:
Users that are interested in Wikipedia-Article-Scraper are comparing it to the libraries listed below
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Lyric Generation using AI☆12Updated 5 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- A classifier that distinguishes political from non-political news articles.☆29Updated last year
- Scripts for building a geo-located web corpus using Common Crawl data☆11Updated 2 months ago
- Leverage the power of the Google Natural Language API NLP to retrieve entity relationships from Wikipedia URLs or topics! Get interactive…☆14Updated 3 years ago
- SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time☆39Updated last year
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆11Updated 5 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlap☆12Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Translation demonstrator☆29Updated 4 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Discourse Analysis Tool Suite☆18Updated this week
- Scrape and parse Google search results in Python☆32Updated last year
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆33Updated 4 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆121Updated 8 months ago
- an experimental implementation of Burrow's delta in Python 3☆20Updated 3 years ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated last year
- ☆15Updated last year
- Download subreddit comments☆93Updated 2 years ago
- python package for calculating famous measures in computational linguistics☆13Updated 2 months ago
- An online NLP tool/application which will correct grammar mistakes (like Grammarly) and will also rewrite the sentences in a different fo…☆10Updated 5 years ago
- Python wrapper library for the Datamuse API☆77Updated last year
- Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming☆11Updated 4 years ago
- A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT l…☆78Updated 2 months ago
- Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingwai☆40Updated 2 years ago