yashwordlife / SportsDataAnalysisLinks
a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and creates a word cloud of most frequently occurring words. Python scripts are developed for gathering data and processing on a Hadoop MR infrastructure. Angular with D3.js is used to create an interactive web app …
☆13Updated 5 years ago
Alternatives and similar repositories for SportsDataAnalysis
Users that are interested in SportsDataAnalysis are comparing it to the libraries listed below
Sorting:
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- Streamlit application to keep GPT3 Experimentation sane☆23Updated 4 years ago
- Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.☆73Updated 2 years ago
- Expose a Top2Vec model with a REST API.☆92Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆316Updated last year
- Various Jupyter notebooks about Common Crawl data☆57Updated 5 months ago
- 📊 Semantic search for headlines and story text☆360Updated last year
- Process Common Crawl data with Python and Spark☆440Updated last week
- Conversational AI tooling & personas built on Cohere's LLMs☆173Updated 2 years ago
- GPU-Powered Topic Modelling☆70Updated 2 years ago
- Google Cloud Storage connector, pre-processor and model for predicting user search intent based on keywords☆25Updated 5 years ago
- ☆10Updated 2 years ago
- Generate True or False questions from any content with OpenAI GPT2 text generation, Sentence-BERT semantic search and Berkley constituenc…☆34Updated 5 years ago
- Dataset for Intagram Fake and Automated Account Detection☆58Updated 5 years ago
- ☆168Updated last week
- Dash app for classifying tweets in real-time☆67Updated 2 years ago
- new skills taxonomy using TextKernel data☆35Updated 2 years ago
- ☆122Updated last month
- Scraping Amazon website using Proxies for extracting Mobile details☆13Updated 6 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- Social Media Mining Toolkit (SMMT) main repository☆137Updated 2 years ago
- A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.☆27Updated 5 years ago
- ☆19Updated 11 months ago
- Build tensorflow keras model pipelines in a single line of code. Now with mlflow tracking. Created by Ram Seshadri. Collaborators welcome…☆121Updated last year
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆233Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Unreliable News Index (for Columbia Journalism Review)☆56Updated 3 years ago
- ☆33Updated 2 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago