kohjiaxuan / Wikipedia-Article-ScraperLinks
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
☆19Updated 2 years ago
Alternatives and similar repositories for Wikipedia-Article-Scraper
Users that are interested in Wikipedia-Article-Scraper are comparing it to the libraries listed below
Sorting:
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Data sourcing and pre-processing for raplyrics.eu - A rap music lyrics generation project☆65Updated 11 months ago
- python package for calculating famous measures in computational linguistics☆14Updated 7 months ago
- A Python library designed for scraping data from the SCP wiki.☆15Updated 4 years ago
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆28Updated 4 years ago
- HDBSCAN Tuning for BERTopic Models☆47Updated 2 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- CaseText Court Case analysis with fine-tuned BERT Transformer☆15Updated 4 years ago
- ☆22Updated 3 years ago
- Use gpt3 to brainstorm ideas for long form fiction (novels, screenplays, etc.)☆75Updated 2 years ago
- downloads and parses subtitle dataset from opensubtitles.org☆16Updated last year
- Lexicons for the Multilingual UCREL Semantic Analysis System☆42Updated last year
- Python Multilingual Ucrel Semantic Analysis System☆31Updated 9 months ago
- A tool that visualizes emotional arcs of movie scripts☆18Updated 2 years ago
- Analysis of gutenberg dataset☆44Updated 6 years ago
- Flat files containing available context annotation entities.☆35Updated 2 years ago
- ☆18Updated 10 months ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 6 months ago
- An NLP pipeline for Hebrew☆38Updated this week
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆33Updated 8 months ago
- Tools for scraping YouTube video metadata (mostly for training AI on video titles)☆41Updated 4 years ago
- ☆56Updated 2 years ago
- A Google Trends Analytics Package☆13Updated last year
- A Streamlit app to extract keywords using KeyBert☆37Updated 4 years ago
- A classifier that distinguishes political from non-political news articles.☆30Updated last year
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆124Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆47Updated 2 years ago
- Resources on AI applications in the music domain☆18Updated 3 months ago
- A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT l…☆80Updated last month
- 📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!☆19Updated 2 years ago