kohjiaxuan / Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
☆19Updated 2 years ago
Alternatives and similar repositories for Wikipedia-Article-Scraper:
Users that are interested in Wikipedia-Article-Scraper are comparing it to the libraries listed below
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated 10 months ago
- ☆64Updated last year
- clustering news, extract trending news stories☆12Updated 3 years ago
- Automatic Text Summarization and Title Generation.☆25Updated 3 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆123Updated 9 months ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆44Updated 2 years ago
- Cleans Reddit Text Data☆81Updated 4 years ago
- A Google Trends Analytics Package☆13Updated 9 months ago
- The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply u…☆50Updated 7 years ago
- Summarize your video to any duration.☆33Updated 2 years ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆216Updated last year
- A simple machine learning package to cluster keywords in higher-level groups.☆16Updated 2 years ago
- Ethical, legal, and effortless extraction of Reddit data in your database☆64Updated 5 months ago
- Download subreddit comments☆93Updated 3 years ago
- ☆18Updated 7 months ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- Text2Text Language Modeling Toolkit☆299Updated last month
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 4 years ago
- ☆55Updated 2 years ago
- This script will dump youtube video comments to a CSV from youtube video links. Video links can be placed inside a variable or list or CS…☆41Updated 3 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆101Updated last year
- A synchronous and asynchronous API wrapper for the UberDuck text-to-speech service (https://uberduck.ai) with 100% coverage and top-notch…☆23Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆76Updated last year
- 📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!☆19Updated 2 years ago
- Scrape and parse Google search results in Python☆31Updated last year
- Python Multilingual Ucrel Semantic Analysis System☆31Updated 6 months ago
- simple rule based named entity recognition☆43Updated 3 years ago
- HDBSCAN Tuning for BERTopic Models☆44Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago