mikekestemont / ruzicka
☆12Updated last year
Alternatives and similar repositories for ruzicka:
Users that are interested in ruzicka are comparing it to the libraries listed below
- An authorship attribution project with particular emphasis on Twitter analysis☆16Updated 3 years ago
- A set of utilities for processing MediaWiki XML dump data.☆50Updated 6 months ago
- an experimental implementation of Burrow's delta in Python 3☆20Updated 3 years ago
- ☆59Updated 3 weeks ago
- Measure the similarity of text corpora for 74 languages☆13Updated last year
- Collection of data about URL filtering in various countries☆41Updated 8 years ago
- A Memento Aggregator CLI and Server in Go☆61Updated 8 months ago
- KeyTerms centralized terminology management tool☆13Updated 5 years ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆45Updated 2 years ago
- ☆27Updated 2 years ago
- R package for stylometric analyses☆180Updated 3 weeks ago
- A PDF classifier ensemble with REST API service☆23Updated 3 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.