diegoceccarelli / json-wikipedia
Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump
☆253Updated last year
Alternatives and similar repositories for json-wikipedia:
Users that are interested in json-wikipedia are comparing it to the libraries listed below
- Software and resources for natural language processing.☆131Updated 8 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Stanford Pattern-based Information Extraction and Diagnostics -- Visualization☆93Updated 10 years ago
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆185Updated 5 years ago
- Model Training tool for MITIE☆79Updated 9 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆89Updated 7 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- ☆184Updated 6 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆199Updated 7 years ago
- Joshua Statistical Machine Translation Toolkit☆122Updated 8 years ago
- Dexter is a framework that implements some popular algorithms and provides all the tools needed to develop any entity linking technique.☆206Updated 8 years ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆92Updated 6 years ago
- An open source toolkit for mining Wikipedia☆129Updated 6 years ago
- Automatically exported from code.google.com/p/deepsyntacticparsing☆23Updated 10 years ago
- SemanticVectors creates semantic WordSpace models from free natural language text.☆218Updated 2 years ago
- DBpedia.org RDF to CSV for import into Neo4j☆52Updated 10 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 7 years ago
- Extension of the mate-tools NLP pipeline☆67Updated 9 years ago
- The S-Space repsitory, from the AIrhead-Research group☆205Updated 4 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated 2 years ago
- ESA implementation using Wikiprep output☆56Updated 11 years ago
- A large-scale statistical machine translation system written in Java.☆209Updated 3 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Ukb: graph-based WSD and similarity☆106Updated 11 months ago
- Yara K-Beam Arc-Eager Dependency Parser☆56Updated 9 years ago
- Quality information extraction at web scale. Edit☆329Updated 8 years ago