delip / wikixmlj
WikiXMLJ provides easy access to Wikipedia XML dumps.
☆21Updated 7 years ago
Related projects: ⓘ
- Ready-to-use examples of dkpro-core components and pipelines.☆34Updated 9 months ago
- An efficient and flexible token-based regular expression language and engine.☆74Updated 10 years ago
- ☆42Updated this week
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆81Updated 6 months ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- Word and text similarity measures☆53Updated 2 years ago
- A Java Wikipedia markup to plain text converter☆37Updated 2 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆56Updated 2 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆54Updated 7 years ago
- Apache OpenNLP Sandbox☆42Updated this week
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆41Updated 11 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆124Updated 6 months ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- An open relation extraction system☆46Updated 2 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆70Updated 4 years ago
- A Java implementation of the Rapid Automatic Keyword Extraction Framework ( RAKE )☆28Updated 6 years ago
- A dependency tree visualizer for the Stanford Typed-Dependency Parser☆68Updated 7 years ago
- Analytic UIMA pipelines using Spark☆23Updated 8 years ago
- A convenience Java wrapper around GloVe word vectors and converter to more space efficient binary files.☆24Updated 3 years ago
- KEA - Keyphrase Extraction Algorithm☆21Updated 8 years ago
- Word2Vec Java Port☆186Updated 6 years ago
- The S-Space repsitory, from the AIrhead-Research group☆203Updated 3 years ago
- ☆11Updated this week
- DKPro Lab offers a workflow engine for parameter sweeping experiments.☆9Updated 9 months ago
- Apache Joshua☆104Updated 4 years ago
- Java text categorization system☆54Updated 7 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 11 years ago
- Machine learning components for Apache UIMA☆129Updated last year
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 2 years ago