A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarc
☆30Jan 17, 2026Updated last month
Alternatives and similar repositories for qwarc
Users that are interested in qwarc are comparing it to the libraries listed below
Sorting:
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆30Sep 20, 2025Updated 5 months ago
- Sources for urls-grab.☆13Feb 24, 2026Updated last week
- A GitHub action that checks for Python syntax errors using pyflakes☆12Feb 12, 2019Updated 7 years ago
- Decentralized web archiving☆20Aug 7, 2018Updated 7 years ago
- Tower configuration server☆10Jul 14, 2025Updated 7 months ago
- README for AbantOS (Elive/Debian Custom Distribution) Pick a Section, fork, edit, commit, pull, merge☆12Dec 14, 2017Updated 8 years ago
- CDXJ Indexing of WARC/ARCs☆33Dec 10, 2024Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆41Nov 24, 2025Updated 3 months ago
- ☆13Nov 21, 2023Updated 2 years ago
- A configurable, reusable tracker with dashboard☆36Dec 15, 2023Updated 2 years ago
- Architecture of Twint scrapper which allow download tweets on many instances without api restrictions☆10Nov 30, 2020Updated 5 years ago
- A Python Reddit scraper with dual-mode architecture: simple requests for small jobs, async + proxy rotation for large-scale scraping. Fea…☆16Oct 30, 2025Updated 4 months ago
- The most flexible modern open source authentication server for your cloud.☆10Mar 7, 2023Updated 2 years ago
- ☆11Jan 28, 2023Updated 3 years ago
- Web app which displays the daily and hourly sentiments for a stock (user to enter ticker as input). Stock sentiments are determined from…☆10Sep 26, 2022Updated 3 years ago
- ☆12Nov 11, 2025Updated 3 months ago
- A Python package for accessing the OpenCorporates API☆11Feb 12, 2019Updated 7 years ago
- OpenPGP in Python using Sequoia PGP☆18Feb 25, 2026Updated last week
- [Course Project, CS 251( 2018-1 ) - IIT Bombay] A secure Personal Cloud storage for files - Web Application( Django)☆10Mar 2, 2020Updated 6 years ago
- Dump elasticsearch instance☆15Jan 7, 2026Updated last month
- A python client for the DPLA API☆43Oct 3, 2022Updated 3 years ago
- Semantic search web application with graph visualization in Django☆12Aug 2, 2017Updated 8 years ago
- Individually compiled based on Quin's addon list.☆11Oct 16, 2019Updated 6 years ago
- 移动端UI自动化测试脚本,Appium + Cucumber测试模式,Ruby编写。https://www.jianshu.com/p/c3db8e5dc306☆11Jun 15, 2018Updated 7 years ago
- Twitter based sentiment analysis using JAVA and Hadoop. In this project we are doing the sentiment analysis on twitter data to analyse wh…☆10Apr 22, 2018Updated 7 years ago
- Application which supports the UNC Libraries' Digital Collections Repository☆12Updated this week
- Examples for using the Pipl SEARCH API☆11Dec 19, 2023Updated 2 years ago
- PAP/API Lite eller PAPILITE som det förkortas till, är ett oberoende och öppet REST API med alla postnummer och postorter för Sverige, Da…☆10Jul 10, 2022Updated 3 years ago
- This repository defines a python class that can be used to load data for the tf.keras.model.fit_generator function by using a torch.utils…☆11Oct 26, 2024Updated last year
- Vue.js + S3 => CMS☆12Nov 23, 2021Updated 4 years ago
- Islandora Solr Search module☆24Jul 28, 2025Updated 7 months ago
- Windows Dev Home Application☆17Jan 29, 2024Updated 2 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆48Mar 19, 2018Updated 7 years ago
- Serving content from a WARC☆62Jan 5, 2013Updated 13 years ago
- Models, vocabularies and behaviours for Hyrax applications.☆11Sep 21, 2023Updated 2 years ago
- DEPRECATED: (See GeoBlacklight repo) A metadata schema for GIS resource discovery used by GeoBlacklight☆15Jan 10, 2018Updated 8 years ago
- The easiest way to run shell commands with Python. A python command line object mapper.☆26Jan 30, 2015Updated 11 years ago
- Build wordlists from the common-crawl index☆12Oct 9, 2022Updated 3 years ago
- Bearsql allows you to query pandas dataframe with sql syntax. It uses duckdb as the internal processing engine☆15Sep 20, 2023Updated 2 years ago