☆50Feb 22, 2017Updated 9 years ago
Alternatives and similar repositories for example-warc-java
Users that are interested in example-warc-java are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Aug 12, 2018Updated 7 years ago
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆168Jan 27, 2026Updated 2 months ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Oct 9, 2017Updated 8 years ago
- A S3 hybrid storage interface for dat and hyperdrive☆13Jul 31, 2018Updated 7 years ago
- React ECharts for ClojureScript☆13Oct 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Dec 4, 2017Updated 8 years ago
- ETL Utilities for Clojure☆30Apr 6, 2025Updated last year
- Library to ease use of the MASON ABM library with Clojure☆14Feb 8, 2023Updated 3 years ago
- Logging configuration with timbre☆18Oct 1, 2014Updated 11 years ago
- A Clojure library to easily write tests with files.☆15Feb 1, 2023Updated 3 years ago
- Lehigh University Benchmark (LUBM).☆10Apr 22, 2020Updated 5 years ago
- Clojurified Apache Curator☆24Mar 8, 2020Updated 6 years ago
- the web non-framework☆21Jan 22, 2018Updated 8 years ago
- Clojure library and command line application for converting CSV to RDF. An implementation of the W3C CSVW specifications☆29Apr 1, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Semantic File Inspector ‒ RDF-based metadata extraction and semantic search☆19Mar 19, 2025Updated last year
- Additional validators for Prismatic's Schema.☆32Sep 6, 2021Updated 4 years ago
- Clojure wrapper for the `jackson-jq `. Embed `jq` scripts into your app. Compatible with GraalVM native-image.☆21Sep 29, 2023Updated 2 years ago
- A sweet Clojure API for Atomix☆17Feb 17, 2017Updated 9 years ago
- Clone of iris-reasoner (http://iris-reasoner.org) from sourceforge☆11Mar 18, 2016Updated 10 years ago
- Clojure-idiomatic GDAL wrapper☆17Jul 18, 2016Updated 9 years ago
- A simple library to convert an Object/Document/etc into Clojure EDN data☆23Aug 25, 2018Updated 7 years ago
- Like Awk, but Clojure.☆51Jul 15, 2015Updated 10 years ago
- Flexible progress module for Clojure.☆30Sep 11, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An offshoot of the Awesome-Public-Datasets repo I'm cultivating☆15Dec 3, 2019Updated 6 years ago
- See through the darkness of a running program.☆51Nov 13, 2013Updated 12 years ago
- 基于人工神经网络的中文语义相似度计算研究☆11Apr 1, 2013Updated 13 years ago
- 宝贝鱼(CshBBrainAIO) 是一个来自中国 的简单的轻量级的高性能的WebSocket服务器。支持服务器集群,能满足大并发量高容量的分布式系统开发。如果你需要开发带有集群功能的WebSocket服务器,宝贝鱼(CshBBrainAIO) 也许是非常适合你的选择。在宝贝…☆16Dec 6, 2012Updated 13 years ago
- Packer build scripts for NixOS base images☆32Sep 11, 2015Updated 10 years ago
- 时序的金融领域知识图谱构建及问答 以年报为数据 jena为框架☆11Aug 16, 2018Updated 7 years ago
- A Clojure-like Datomic API for ClojureScript☆57Nov 26, 2013Updated 12 years ago
- Having fun with core.async☆103Oct 5, 2013Updated 12 years ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Crux Database Client☆19Jan 24, 2021Updated 5 years ago
- Clojure concurrent pipleline on core.async☆47Jan 25, 2019Updated 7 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆37Dec 7, 2014Updated 11 years ago
- site crawler for knowledge graph☆14Jul 4, 2018Updated 7 years ago
- Library to operate on a content-addressed graph of nodes with directed merkle-hash links☆24Nov 23, 2021Updated 4 years ago
- A library for serializing Prismatic Schema definitions with Transit.☆31Sep 21, 2020Updated 5 years ago
- A thin Clojure wrapper for the Java API for FoundationDB.☆27Dec 19, 2022Updated 3 years ago