henchc / web-scrapers
various web scrapers as examples
☆17Updated 4 years ago
Alternatives and similar repositories for web-scrapers:
Users that are interested in web-scrapers are comparing it to the libraries listed below
- Collection of Jupyter notebooks for downloading Twitter data☆24Updated 8 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆44Updated this week
- A repo for talk materials☆25Updated 4 years ago
- ☆59Updated 3 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆32Updated 3 years ago
- web scrapping in python: multiple libraries -requests, beautifulsoup, mechanize, selenium☆60Updated 8 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated last year
- An overview of the Folium library to visualize Geospatial data☆19Updated 6 years ago
- Custom Named Entity Recognition annotated using NER Annotated by tecoholic and Spacy for training the model☆16Updated 4 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Simple duckduckgo results scraping☆67Updated 7 years ago
- Word2Vec encodings based search engine for Stackoverflow questions☆26Updated last year
- Text summarization using spacy☆22Updated 2 years ago
- "BI Glue" Business Intelligence middleware library for aggregation of metrics/KPI from any source and custom reporting for humans or othe…☆10Updated 10 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆27Updated 4 years ago
- Python, Tor, Stem, Privoxy: with this tools, allow requests new connections via Tor for obtain new IP addresses.☆24Updated 6 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Python library for finding phone numbers in random user input text.☆9Updated 7 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 7 years ago
- Python library to infer date format from examples☆42Updated 3 years ago
- A Notebook based on NLP Spacy course☆57Updated last year
- TensorFlow implementations of several deep learning models (e.g. variational autoencoder, RNN, ...)☆37Updated 6 years ago
- ☆31Updated last year
- Python module for Named Entity Recognition (NER) using natural language processing.☆13Updated 3 years ago
- How to build an end to end search engine using elasticsearch and angularjs☆26Updated 6 years ago
- Detect whether a social media comment is insulting or derogatory☆23Updated 2 years ago
- A selection of business datasets☆17Updated 5 years ago