yashwordlife / SportsDataAnalysisLinks
a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and creates a word cloud of most frequently occurring words. Python scripts are developed for gathering data and processing on a Hadoop MR infrastructure. Angular with D3.js is used to create an interactive web app …
☆13Updated 6 years ago
Alternatives and similar repositories for SportsDataAnalysis
Users that are interested in SportsDataAnalysis are comparing it to the libraries listed below
Sorting:
- Expose a Top2Vec model with a REST API.☆92Updated 3 years ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆318Updated last year
- Scraping Amazon website using Proxies for extracting Mobile details☆13Updated 6 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆244Updated 2 years ago
- ☆127Updated 5 months ago
- Streamlit application to keep GPT3 Experimentation sane☆23Updated 4 years ago
- Dataset and pre-trained model for Skill2vec☆84Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- Various Jupyter notebooks about Common Crawl data☆61Updated last month
- A Python Package which helps to scrape all news details from any news websites☆220Updated 7 months ago
- Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.☆74Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Social Media Mining Toolkit (SMMT) main repository☆136Updated 3 years ago
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.☆27Updated 5 years ago
- Dash app for classifying tweets in real-time☆68Updated 2 years ago
- Voice of the Customer (VoC) to enhance customer experience with serverless architecture and sentiment analysis, using Amazon Kinesis, Ama…☆25Updated last year
- ☆57Updated 3 years ago
- Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituenc…☆34Updated 5 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Dataset for Intagram Fake and Automated Account Detection☆61Updated 6 years ago
- The Selenium scraper that collected a million stories from Medium.com☆82Updated 7 years ago
- Text summarization algorithm for the Capstone Project at Springboard code bootcamp☆54Updated 2 years ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆228Updated 3 years ago
- ☆28Updated 5 years ago
- Companion Repo for the book The Applied ML Field Manual, Prithiviraj Damodaran☆12Updated 3 years ago
- Repository for Project Insight: NLP as a Service☆308Updated 2 years ago
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆92Updated 3 weeks ago