kohjiaxuan / Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
☆19Updated 2 years ago
Alternatives and similar repositories for Wikipedia-Article-Scraper:
Users that are interested in Wikipedia-Article-Scraper are comparing it to the libraries listed below
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆123Updated 10 months ago
- Lyric Generation using AI☆12Updated 5 years ago
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Hum2Song: Multi-track Polyphonic Music Generation from Voice Melody Transcription with Neural Networks☆126Updated 2 years ago
- A Google Trends Analytics Package☆13Updated 10 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming☆11Updated 4 years ago
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆28Updated 4 years ago
- Collects a multimodal dataset of Wikipedia articles and their images☆15Updated 2 years ago
- Predicting Billboard's Year-End Hot 100 Songs using audio features from Spotify and lyrics from Musixmatch☆16Updated 9 months ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆40Updated 5 years ago
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- ☆55Updated 2 years ago
- Download subreddit comments☆94Updated 3 years ago
- Storyfinder - A Browser Plugin and Server Backend for Personalized Knowledge- and Information Management☆16Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- A Python library designed for scraping data from the SCP wiki.☆15Updated 4 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Automated paraphrases Generation☆36Updated 2 years ago
- Data sourcing and pre-processing for raplyrics.eu - A rap music lyrics generation project☆63Updated 9 months ago
- Measure the readability of a given text using surface characteristics☆78Updated 2 months ago
- Cleans Reddit Text Data☆81Updated 5 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- clustering news, extract trending news stories☆12Updated 3 years ago
- Automatic Text Summarization and Title Generation.☆25Updated 3 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 4 years ago
- This repository is designed for students in DIGI405 at the University of Canterbury to do topic modeling through their browser using Goog…☆18Updated 3 years ago
- GPT2Explorer is bringing GPT2 OpenAI langage models playground to run locally on standard windows computers.☆29Updated 2 years ago