sweble / sweble-wikitextLinks
The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaWiki.
☆72Updated last year
Alternatives and similar repositories for sweble-wikitext
Users that are interested in sweble-wikitext are comparing it to the libraries listed below
Sorting:
- Various utilities regarding Levenshtein transducers. (Java)☆59Updated 4 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆88Updated this week
- A text tagger based on Lucene / Solr, using FST technology☆177Updated 2 years ago
- NLP framework for JVM languages.☆154Updated 4 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆131Updated last year
- A set of reusable Java components that implement functionality common to any web crawler☆252Updated last week
- Mirror of Apache Stanbol (incubating)☆116Updated last year
- SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 6 years ago
- Apache OpenNLP Sandbox☆46Updated this week
- TinkerPop3 Graph Structure Implementation for OrientDB☆94Updated 2 weeks ago
- ☆185Updated 7 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆201Updated last month
- Common web archive utility code.☆61Updated this week
- WARC (Web Archive) Input and Output Formats for Hadoop☆37Updated 11 years ago
- Java text categorization system☆57Updated 8 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆45Updated 12 years ago
- Java library for reading and writing WARC files with a typed API☆54Updated 2 weeks ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆284Updated 7 years ago
- Apache Commons RDF☆53Updated last week
- Write parsers for arbitrary text inputs, entirely in Java, with no preprocessing phase☆66Updated 9 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Apache UIMA Java SDK☆66Updated 3 months ago
- Browser-driven explorer for lucene indexes☆74Updated 4 years ago
- Software and resources for natural language processing.☆132Updated 9 years ago
- Java library to interact with Wikibase☆404Updated last week
- RDF store on a cloud-based architecture (previously on https://code.google.com/p/cumulusrdf)☆31Updated 9 years ago
- Apache Joshua☆111Updated 5 years ago
- An RDF plugin for Solr☆114Updated last year
- Machine learning components for Apache UIMA☆132Updated 2 years ago