A scrapy project to extract the text and metadata of articles from news websites
☆73Oct 7, 2021Updated 4 years ago
Alternatives and similar repositories for RISJbot
Users that are interested in RISJbot are comparing it to the libraries listed below
Sorting:
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆18Mar 28, 2022Updated 3 years ago
- Developing a news app with different recommender systems☆13May 22, 2023Updated 2 years ago
- A toy project with Scrapy + Django + Celery to run on Heroku☆13Sep 8, 2015Updated 10 years ago
- Research compendium for reproducible research☆12Sep 7, 2020Updated 5 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 4 years ago
- Sample codes and datasets for COMM7780/ JOUR7280 @ HKBU☆13Aug 11, 2019Updated 6 years ago
- A powerful and automatic awesome list generator leveraging GPT models. Enter your keyword and its description to get a comprehensive list…☆39Feb 5, 2026Updated last month
- Example code that launches a docker container on AWS Fargate from AWS Lambda☆18Dec 24, 2017Updated 8 years ago
- Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.☆76Mar 18, 2022Updated 4 years ago
- Cordova Android TV Plugin☆19Jul 30, 2021Updated 4 years ago
- Docker container running scrapyd with HTTP authentication☆41May 14, 2024Updated last year
- The algorithms for multilevel evaluation of balance in signed directed networks☆10Jul 4, 2024Updated last year
- Code from my series of articles on Flask☆19Oct 18, 2021Updated 4 years ago
- Working with newspaper data from 'LexisNexis'☆112Apr 17, 2024Updated last year
- ☆14Feb 28, 2017Updated 9 years ago
- ☆12Apr 12, 2023Updated 2 years ago
- Code and data for "Detecting Stance in Media on Global Warming".☆15Dec 8, 2022Updated 3 years ago
- Automatically generate Doom-metal album covers☆10May 29, 2015Updated 10 years ago
- ☆14May 13, 2022Updated 3 years ago
- A Cordova plugin to handle the HTML5 gamepad API for iOS and Android☆21Oct 18, 2014Updated 11 years ago
- Sick of Hacker News, Reddit too mainstream and Digg too lame? Meet Zebra an open source and stripped back news submission website like Ha…☆41May 3, 2013Updated 12 years ago
- ☆10Jan 31, 2021Updated 5 years ago
- ☆13May 13, 2022Updated 3 years ago
- A lightweight Pyramid blog application suitable for introducing Python web development.☆11Oct 17, 2016Updated 9 years ago
- Yet another IPython notebook to LaTeX converter - this one exports clean code easily absorbed in other reports.☆16Jun 1, 2023Updated 2 years ago
- ☆25Apr 6, 2015Updated 10 years ago
- Scrapy entrypoint for Scrapinghub job runner☆25Feb 26, 2026Updated 3 weeks ago
- The Erasmian Language Model☆14Jul 3, 2024Updated last year
- project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf☆12Mar 22, 2015Updated 10 years ago
- Configurable Chart Collection - New renamed library is available at drarmstr/chartcollection:☆17Oct 25, 2016Updated 9 years ago
- A scrapy extension to store requests and responses information in storage service☆27Mar 11, 2022Updated 4 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆57Mar 16, 2022Updated 4 years ago
- ☆12Feb 2, 2021Updated 5 years ago
- first draft for solidata_frontend : vue+nuxt+vuetify+i18n+axios☆17Dec 22, 2022Updated 3 years ago
- Lab for exercising SPARQL☆12Jan 16, 2022Updated 4 years ago
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3☆22Jun 24, 2014Updated 11 years ago
- [OBSOLETE] - needs updating - An example client to the Stremio add-ons protocol, similar to Stremio's Discover☆10Jul 31, 2018Updated 7 years ago
- ☆19May 1, 2023Updated 2 years ago
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.☆558Dec 28, 2022Updated 3 years ago