will3216 / newspaper3k_lambda_template
Pre-built template for using newspaper3k on aws lambda
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for newspaper3k_lambda_template
- ETL of newspaper article keywords using Apache Airflow, Newspaper3k, Quilt T4 and AWS S3☆15Updated last week
- The Summarlight Chrome Extension highlights the most important parts of posts/stories/articles.☆26Updated 5 years ago
- Techcrunch Incremental Scrapy Spider With MongoDB☆16Updated 5 years ago
- CLI to extract article contents in bulk using Newspaper3k and multithreading.☆13Updated 6 years ago
- Comparison of Airflow on Celery vs Celery☆21Updated 6 years ago
- Add website scraping abilities to Datasette☆61Updated last year
- Tag news stories based on models trained on the NYT corpus.☆40Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Resize image on the fly using flask, zappa, pillow, opencv-python☆18Updated 7 years ago
- Save an RSS or ATOM feed to a SQLite database☆47Updated 2 years ago
- Run Datasette on AWS serverless.☆17Updated 4 years ago
- Blackstone is a spaCy model and library for processing long-form, unstructured legal text. Here, we wrap Blackstone with a performant API…☆58Updated 3 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated last week
- Inspect a URL and estimate if it contains a news story☆39Updated last month
- 100k+ topic labeled news articles published from thousands of news websites☆18Updated 4 years ago
- Simple dashboard for getting currently trending hashtags and topics on Twitter☆25Updated last year
- Text summarization using spacy☆22Updated last year
- ☆10Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆52Updated 3 weeks ago
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- A search engine for Open Data☆53Updated last year
- Search sites for RSS, Atom, and JSON feeds.☆18Updated last year
- Convert JSON to a set of tidy CSV files☆23Updated last month
- A small Python module containing quick utility functions for standard ETL processes.☆33Updated last week
- Scripts to consume and analyze the GDELT project's data☆26Updated 7 years ago
- 🔮 Programmatic time-based job scheduler☆20Updated last year
- Library for scraping websites or apis at any scale☆54Updated 9 months ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago