yashwordlife / SportsDataAnalysis
a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and creates a word cloud of most frequently occurring words. Python scripts are developed for gathering data and processing on a Hadoop MR infrastructure. Angular with D3.js is used to create an interactive web app …
☆12Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for SportsDataAnalysis
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- ☆16Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- ☆11Updated 4 years ago
- DrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.☆29Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Streamlit application to keep GPT3 Experimentation sane☆23Updated 3 years ago
- AYLIEN's officially supported Python client library for accessing News API☆18Updated 2 years ago
- Expose a Top2Vec model with a REST API.☆88Updated last year
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- Social Media Mining Toolkit (SMMT) main repository☆133Updated 2 years ago
- NLP: An Approach to Automatic Trending Tweet Summarization. Summaries will greatly help the user in understanding “why the topic is trend…☆15Updated 8 years ago
- ☆16Updated last year
- ☆28Updated 4 years ago
- Exploration of Health-Related Tweets through Topic Modeling & Sentiment Analysis☆20Updated 7 months ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆36Updated 2 years ago
- 🔬 Sharing your data science notebooks with the community has never been this easy.☆38Updated 2 years ago
- code and supplementary materials for a series of Medium articles about the BERT model☆77Updated last year
- ☆13Updated last year
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆26Updated 3 years ago
- Companion Repo for the book The Applied ML Field Manual, Prithiviraj Damodaran☆12Updated 2 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆32Updated 3 years ago
- Dataset for Intagram Fake and Automated Account Detection☆50Updated 5 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 2 years ago
- Document Search Engine Tool☆71Updated last year