sweble / sweble-wikitext
The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaWiki.
☆71Updated last year
Alternatives and similar repositories for sweble-wikitext
Users that are interested in sweble-wikitext are comparing it to the libraries listed below
Sorting:
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆85Updated 2 weeks ago
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Java text categorization system☆56Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Apache Joshua☆106Updated 4 years ago
- Apache OpenNLP Sandbox☆42Updated this week
- Software and resources for natural language processing.☆131Updated 8 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- NLP framework for JVM languages.☆148Updated 4 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 7 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆93Updated 6 years ago
- Write parsers for arbitrary text inputs, entirely in Java, with no preprocessing phase☆65Updated 9 years ago
- Word2Vec Java Port☆186Updated 6 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆199Updated 5 months ago
- Java library for reading and writing WARC files with a typed API☆48Updated 4 months ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Common web archive utility code.☆55Updated 2 months ago
- An RDF plugin for Solr☆114Updated 3 months ago
- RDF store on a cloud-based architecture (previously on https://code.google.com/p/cumulusrdf)☆31Updated 9 years ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 4 years ago
- Machine learning components for Apache UIMA☆129Updated last year
- Apache Commons RDF☆47Updated this week
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago