vaibkumr / DatasetScraperLinks
Tool to create image datasets for machine learning problems by scraping search engines like Google, Bing and Baidu.
☆16Updated 6 years ago
Alternatives and similar repositories for DatasetScraper
Users that are interested in DatasetScraper are comparing it to the libraries listed below
Sorting:
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- An easy-to-use python client for Google News feeds.☆50Updated 3 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- The Selenium scraper that collected a million stories from Medium.com☆80Updated 6 years ago
- Live stream tweets based on keywords to database using SQLAlchemy. Tweets are assigned a sentiment score and data is presented via stream…☆43Updated 4 years ago
- Pair: image-based product collection recommender☆19Updated 5 years ago
- Using the adjacency matrix and random forest get the Name, Address, Items, Prices, Grand total from all kind of invoices.☆18Updated 5 years ago
- A Python Package which helps to scrape all news details from any news websites☆211Updated last month
- Instagram-like filters with deep learning☆56Updated last year
- Spectrum is an AI that uses machine learning to generate Rap song lyrics☆46Updated 4 years ago
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆29Updated 2 years ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆119Updated 5 years ago
- Automatically transcribes YouTube videos☆91Updated 5 years ago
- A set of NLP tools created during my medium NLP Explanation series.☆31Updated last year
- COLLABORATE in building a collection of google COLAB notebooks☆72Updated 2 years ago
- Get the estimated value of a property from Redfin and Zillow☆23Updated last week
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆28Updated 4 years ago
- An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.☆21Updated last year
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Updated 2 years ago
- Text classification automl☆21Updated 4 years ago
- Scrapers from a project in 2018. Yelp, Spyfu, Similarweb, Morningstar, Linkedin, Instagram, Inside, Glassdoor, Facebook, Eat24, Doordash,…☆97Updated 6 years ago
- GraphiPy: Universal Social Data Extractor☆84Updated 2 years ago
- A US equities trading & settlement calendar command-line tool☆11Updated 3 years ago
- Investigate how mutual funds leverage credit derivatives by studying their routine filings to the SEC using NLP techniques 📈🤑☆51Updated 6 months ago
- Reddit title generator API based on GPT-2☆19Updated 5 years ago
- Collecting news articles for all the companies in the R1000, for a pre-defined set of news outlets, using Diffbot's Knowledge Graph☆12Updated 2 years ago
- Generate and debug Python code- with some help from AI☆74Updated 4 years ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆54Updated 2 years ago