dylanjcastillo / google_books_crawler
Python crawler for getting books' metadata from the Google Books API using asyncio and aiohttp
☆22Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for google_books_crawler
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- doccano auto labeling pipeline helps doccano to annotate a document automatically.☆40Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- ☆20Updated 2 years ago
- Streamlit dashboard of StarTrek character interactions☆10Updated last year
- A tidy and complete archive of metadata for papers on arxiv.org, 1993-2019☆28Updated 4 years ago
- Repository of the HBCP project.☆18Updated 4 months ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- This repository contains my work that supports my article on Towards Data Science: "Exploring the Most Popular Machine Learning and Deep …☆20Updated 2 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 3 years ago
- Matplotlib style configurator, built with Streamlit☆29Updated 4 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 5 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated 7 months ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆32Updated last year
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 3 years ago
- A sentiment analysis project performed on data collected from Twitter mentioning the two primary contestants in the 2020 US Elections.☆11Updated 4 years ago
- ☆22Updated 2 years ago
- Package to parse and analyze trademark data from the United States Patent and Trademark Office☆12Updated 7 years ago
- Scraping Assisted by Learning☆35Updated this week
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- Learning and buiding API using Fast API☆12Updated 3 years ago
- help kids learn python☆27Updated this week
- Stylometric framework in Python☆13Updated 9 years ago
- ☆22Updated 3 years ago
- ☆15Updated 3 years ago
- Writing Primer for Data Scientists☆18Updated 4 years ago
- Create and customize your own Periodic Table. With the help of Streamlit and Bokeh.☆19Updated 2 years ago
- ETL of newspaper article keywords using Apache Airflow, Newspaper3k, Quilt T4 and AWS S3☆15Updated 2 weeks ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated this week