vaibkumr / DatasetScraperLinks
Tool to create image datasets for machine learning problems by scraping search engines like Google, Bing and Baidu.
☆16Updated 6 years ago
Alternatives and similar repositories for DatasetScraper
Users that are interested in DatasetScraper are comparing it to the libraries listed below
Sorting:
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Text classification automl☆21Updated 3 years ago
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Updated 2 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Tools for scraping YouTube video metadata (mostly for training AI on video titles)☆42Updated 4 years ago
- Embedding Visualizer (EmbedViz) data app made with Streamlit library☆22Updated 5 years ago
- Dolores is a Python library designed to improve the developer experience when working with pretrained language models. Dolores provides p…☆34Updated 4 years ago
- COLLABORATE in building a collection of google COLAB notebooks☆72Updated 2 years ago
- A project for predicting personality of individuals using their facebook statistics. In this project we propose a neural network approach…☆14Updated 11 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆24Updated 4 years ago
- This is a document concerning Data Readiness in the context of machine learning and Natural Language Processing.☆11Updated 3 years ago
- Loan Risk Prediction Neural Network and API☆17Updated 4 years ago
- Streamlit application to keep GPT3 Experimentation sane☆23Updated 3 years ago
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Updated 4 years ago
- News API - fetch news from CommonCrawl, parse with NewsPlease, enrich with pre-trained machine-learning models, to structured searchable …☆28Updated 2 years ago
- An easy-to-use python client for Google News feeds.☆50Updated 3 years ago
- An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.☆21Updated last year
- App store search example, using Jina as backend and Streamlit as frontend☆21Updated 3 years ago
- Generate machine learning models fully automatically to clasiffiy any images using SERP data☆12Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- Analyse Big Five personality traits from strings.☆17Updated 7 years ago
- A set of NLP tools created during my medium NLP Explanation series.☆31Updated last year
- A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.☆27Updated 4 years ago
- A News Article Collection Library☆22Updated 2 years ago
- text-data pre-processing utility☆13Updated 2 years ago
- ☆10Updated 4 years ago
- Collecting news articles for all the companies in the R1000, for a pre-defined set of news outlets, using Diffbot's Knowledge Graph☆11Updated 2 years ago
- A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations☆14Updated 6 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆9Updated 2 years ago