diegoceccarelli / json-wikipedia
Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump
☆253Updated last year
Alternatives and similar repositories for json-wikipedia:
Users that are interested in json-wikipedia are comparing it to the libraries listed below
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- ☆184Updated 6 years ago
- Stanford Pattern-based Information Extraction and Diagnostics -- Visualization☆93Updated 10 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Dexter is a framework that implements some popular algorithms and provides all the tools needed to develop any entity linking technique.☆206Updated 7 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆91Updated 6 years ago
- DBpedia.org RDF to CSV for import into Neo4j☆51Updated 9 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- Entity disambiguation evaluation and error analysis tool☆115Updated last year
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆185Updated 5 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated 11 months ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- An open source toolkit for mining Wikipedia☆130Updated 6 years ago
- Keeps a mirror of DBpedia live in sync☆26Updated 3 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 7 years ago
- A Stanford CoreNLP server, with example clients, using Apache Thrift.☆47Updated 6 years ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- SemanticVectors creates semantic WordSpace models from free natural language text.☆218Updated 2 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆148Updated 3 years ago
- Lucene for Information Retrieval☆50Updated 2 years ago
- ☆21Updated 6 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆86Updated 7 years ago
- This tool extracts word vectors from Lucene index.☆134Updated 7 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Disambiguation of Semantic Resources - Full framework☆30Updated 8 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated 4 months ago