ept / warc-hadoopView external linksLinks
WARC (Web Archive) Input and Output Formats for Hadoop
☆37Dec 7, 2014Updated 11 years ago
Alternatives and similar repositories for warc-hadoop
Users that are interested in warc-hadoop are comparing it to the libraries listed below
Sorting:
- Collection of resources, papers, blog posts, and other documentation around working on and with Archivematica.☆21Jan 4, 2024Updated 2 years ago
- Web catalog generator for Nintendo 3DS CIA files.☆12Nov 24, 2018Updated 7 years ago
- The shared memory version of the Alternating Directions Implicit Solver for Isogeometric Analysis☆10Jan 26, 2019Updated 7 years ago
- Common web archive utility code.☆61Feb 6, 2026Updated last week
- Tool for looking at files within the Nintendo DS games Hotel Dusk and Last Window.☆16May 2, 2024Updated last year
- Warcbase is an open-source platform for managing analyzing web archives☆162Dec 8, 2017Updated 8 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- ☆13Aug 11, 2025Updated 6 months ago
- A Python library to simplify batch requests to AWS Services☆12Apr 25, 2020Updated 5 years ago
- Spring Cloud Data Flow Streaming Example☆10Mar 17, 2018Updated 7 years ago
- Java library for object oriented exception handling☆17Jun 7, 2018Updated 7 years ago
- A TensorFlow 2.0 .whl file compiled with an old processor/computer☆11Dec 12, 2020Updated 5 years ago
- Cartesian genetic programming (CGP) in pure Python.☆36Mar 29, 2024Updated last year
- Matcher for json and json template. Can help you with testing of REST API, Database, 3d party systems etc☆15Apr 24, 2021Updated 4 years ago
- AngularJS Data Driven Directives - Using AngularJS to model data and visualize it with D3 AngularJS directives☆13Jul 10, 2014Updated 11 years ago
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- Pure Ruby sparklines.☆42Jun 10, 2021Updated 4 years ago
- This repository contains sample agentic applications in python that can talk to LLM models and perform complex tasks based on user querie…☆16Mar 3, 2025Updated 11 months ago
- pymur is a Python interface to The Lemur Toolkit.☆19Sep 17, 2018Updated 7 years ago
- w3act is an annotation and curation tool for building web archive collections☆21Jan 30, 2024Updated 2 years ago
- Application simulating external APIs for the Practical Rx Workshop☆10May 16, 2015Updated 10 years ago
- Learn Kyo with simple exercises!☆13Aug 25, 2025Updated 5 months ago
- MySQL UDF executing Lua code with storage engine API☆19May 18, 2017Updated 8 years ago
- Chapel Data Object☆10Jun 9, 2021Updated 4 years ago
- Google Cloud Platform support for Upspin☆13Apr 20, 2024Updated last year
- Upload SQLite database files to Datasette☆14Nov 10, 2025Updated 3 months ago
- Encoding of images into audio using the SSTV standard☆10Sep 15, 2018Updated 7 years ago
- My talks☆10Dec 11, 2024Updated last year
- Personal expense tracking application☆10Nov 10, 2018Updated 7 years ago
- KSQL query linter and composer with dependency resolution☆11Jan 10, 2023Updated 3 years ago
- A passkeys demo using Spring Boot and Auth0 as IdP☆14Jan 29, 2024Updated 2 years ago
- Attacking the Nintendo 3DS Boot ROMs☆13Feb 2, 2018Updated 8 years ago
- Deep Learning (PyTorch) Models Deployment using SQL databases☆10Jul 25, 2021Updated 4 years ago
- Decompilation of the Eft library from NintendoWare for Cafe (Wii U)☆12Jan 15, 2022Updated 4 years ago
- Tutorial about Artificial Intelligence, Machine Learning and Deep Learning☆12Aug 9, 2019Updated 6 years ago
- Uses a genetic algorithm to "evolve" brainfuck programs with desirable behaviours☆11Feb 8, 2025Updated last year
- Exploration of Hypercore's breakthrough designs and capabilities, uncovering its gems that may be scattered elsewhere, and learning to th…☆14Sep 29, 2020Updated 5 years ago
- A fast, lightweight HTML to Markdown converter optimized for LLM consumption. Uses proven parsing libraries to deliver clean, well-struct…☆29Feb 2, 2026Updated last week