henchc / web-scrapersLinks
various web scrapers as examples
☆17Updated 5 years ago
Alternatives and similar repositories for web-scrapers
Users that are interested in web-scrapers are comparing it to the libraries listed below
Sorting:
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Simple duckduckgo results scraping☆68Updated 8 years ago
- ☆31Updated 2 years ago
- Python Scraper For Instagram's API☆51Updated 6 years ago
- Open source Emoticons and Emoji detection library: emot☆196Updated 2 years ago
- Here are the notebooks used during the spacy youtube series.☆103Updated 4 years ago
- Social Analysis based on Whatsapp data☆149Updated 2 years ago
- Word2Vec encodings based search engine for Stackoverflow questions☆26Updated 3 months ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago
- Find strings/words in text; convenience and C speed☆126Updated 3 years ago
- A text analysis application for performing common NLP tasks through a web dashboard interface and an API☆125Updated 7 years ago
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆27Updated 5 years ago
- Package that returns a company embedding given a company name☆49Updated 5 years ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- An open-source package for python to clean raw text data☆75Updated 2 years ago
- An example program that scrapes data from AllRecipes.com and store in Elasticsearch☆99Updated 7 years ago
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 7 years ago
- How to build an end to end search engine using elasticsearch and angularjs☆26Updated 7 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- A Stylometry Library for Python☆147Updated 2 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 8 years ago
- Text summarization algorithm for the Capstone Project at Springboard code bootcamp☆54Updated 3 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 8 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 4 years ago
- The tastiest machine learning project. Can we predict who is speaking for how long during an episode of the syntax.fm podcast?☆36Updated 7 years ago
- Let's perform Twitter sentiment analysis using Python, Docker, Elasticsearch, and Kibana!☆138Updated 5 years ago
- Relatively simple text classification powered by spaCy☆41Updated 10 years ago
- 16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.☆229Updated 6 years ago
- Python3 interface to the LinkedIn API☆84Updated 5 years ago
- An NLP pipeline for COVID-19 surveillance used in the Department of Veterans Affairs Biosurveillance.☆16Updated 3 years ago