sweble / sweble-wikitextLinks
The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaWiki.
☆73Updated last year
Alternatives and similar repositories for sweble-wikitext
Users that are interested in sweble-wikitext are comparing it to the libraries listed below
Sorting:
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆86Updated this week
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- NLP framework for JVM languages.☆149Updated 4 years ago
- Java text categorization system☆56Updated 8 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Java library for reading and writing WARC files with a typed API☆49Updated 3 weeks ago
- Common web archive utility code.☆55Updated 2 weeks ago
- A set of reusable Java components that implement functionality common to any web crawler☆244Updated last week
- WARC (Web Archive) Input and Output Formats for Hadoop☆36Updated 10 years ago
- Apache Commons RDF☆48Updated last week
- RDF store on a cloud-based architecture (previously on https://code.google.com/p/cumulusrdf)☆31Updated 9 years ago
- ☆184Updated 6 years ago
- TinkerPop3 Graph Structure Implementation for OrientDB☆93Updated last month
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆129Updated last year
- Java parsers for different RDF serialisations + API + tools + JAX-RS integration☆20Updated 4 years ago
- Apache OpenNLP Sandbox☆43Updated this week
- Browser-driven explorer for lucene indexes☆74Updated 3 years ago
- NLP tools developed by Emory University.☆60Updated 9 years ago
- Apache Anything To Triples (Any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a…☆97Updated 2 years ago
- ☆71Updated 7 years ago
- Word2Vec Java Port☆188Updated 7 years ago
- Fast in-memory graph structure, powering Gephi☆75Updated last month
- An RDF plugin for Solr☆115Updated 6 months ago
- Apache Joshua☆108Updated 4 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆194Updated 2 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago