ThibaudLamothe / scraping-corner
Scraping projects - Mostly using Scrapy, and a bit of selenium π€
β8Updated 3 years ago
Related projects β
Alternatives and complementary repositories for scraping-corner
- Problem Statement The objective of this task is to detect hate speech in tweets. For the sake of simplicity, we say a tweet contains hateβ¦β13Updated 5 years ago
- Course included such topics, as Data Preprocessing, Exploratory Data Analysis (EDA), Statistical Data Analysis (SDA), Data Collection anβ¦β11Updated 2 years ago
- SQLβ13Updated 7 years ago
- Digged into negative reviews, conducted NLP techniques such as sentiment analysis, text processing, n-gram modeling and then created a reβ¦β12Updated 7 years ago
- Content related to Mastering Postgresql along with videos.β14Updated 3 years ago
- Learn AWS Automation with boto3, Python, and Lambda Functions, by Packt Publishingβ13Updated last year
- Building a end-to-end lead scoring machine learning example with Jupyter, Sagemaker, MLflow, and Booklet.ai.β21Updated last year
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Sparkβ11Updated 6 years ago
- β15Updated 2 years ago
- Tools for extracting metadata from Tableau Desktop workbook files.β11Updated 2 years ago
- Singapore Condo Rental Prices - From Data Acquisition to Predictionβ13Updated 3 years ago
- Create Interactive Dashboards With Streamlit in Pythonβ15Updated 4 years ago
- Scraping of LinkedIn Profiles: Creates an Excel file containing the personal data and the last job position of all the provided LinkedIn β¦β119Updated last year
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMRβ12Updated last year
- β10Updated 3 years ago
- β10Updated 9 months ago
- β9Updated 2 years ago
- Predicting customer churn using scikit-learnβ9Updated 6 years ago
- Python Automation To Arrange Files In One Clickβ17Updated last year
- An Airflow pipeline for the collection of historical Twitter dataβ10Updated 5 years ago
- β14Updated last year
- This is a solution that demonstrates how to train and deploy a pre-trained Huggingface model on AWS SageMaker and publish an AWS QuickSigβ¦β11Updated 2 years ago
- β13Updated 2 years ago
- A Telegram Bot that will give you a sentiment score of a certain keyword.β15Updated 4 years ago
- classify crime into different categories using PySparkβ21Updated 5 years ago
- β16Updated 4 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)β14Updated 5 years ago
- This Guidance helps customers set up an ecommerce website on WordPress.β10Updated last month
- Python script for creating Mobile Phones Dataset on GSMArena website.β59Updated last year
- Laptop Prices Predictor is an end-to-end data science project that accurately predicts laptop prices using machine learning algorithms. Tβ¦β14Updated 3 months ago