vinhqdang / wikipedia_analysisLinks
Different techniques to measure the quality of Wikipedia
☆11Updated 8 years ago
Alternatives and similar repositories for wikipedia_analysis
Users that are interested in wikipedia_analysis are comparing it to the libraries listed below
Sorting:
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Updated 9 years ago
- We will process unstructured data from web (obtained by crawling some sample websites) by maybe: having a Apache SolR installation locall…☆17Updated 9 years ago
- The Spicy Project☆13Updated 9 years ago
- A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.☆14Updated 3 years ago
- Netarchivesuite development☆22Updated this week
- Cyberinfrastructure Shell (CIShell) is an open source, community-driven framework/application for the integration and utilization of data…☆31Updated 6 years ago
- Launch NMT tasks on the cloud☆13Updated 2 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆133Updated last year
- Another next-generation event coding platform.☆76Updated 6 years ago
- A rule-based stream reasoning engine utilizing sliding windows☆10Updated 3 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆87Updated this week
- A curated list to help all people who are interested in the transformation of the legal profession and industry, regardless of stage on t…☆22Updated 3 years ago
- Toolforge API to label Wikipedia articles with topics based on Wikidata properties☆11Updated 5 months ago
- The GATE Embedded core API and GATE Developer application☆88Updated 10 months ago
- Searching Open Library by keywords to return ISBNs☆207Updated 4 months ago
- A multilingual linked idioms data set.☆19Updated 7 years ago
- Transliteration data and models☆56Updated 8 years ago
- A service for downloading twitter streaming data. You can save the data either in text files on disk, or in a database (MongoDB).☆23Updated 6 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆35Updated 2 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 2 years ago
- ☆12Updated 4 years ago
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆95Updated 7 years ago
- Mirror of Apache OpenNLP Add-ons☆17Updated 2 weeks ago
- Examples for getting started using https://case.law☆67Updated 2 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 3 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆58Updated 7 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆30Updated 2 months ago
- Apache UIMA Java SDK☆66Updated 6 months ago
- Common web archive utility code.☆56Updated last month
- The curation repository for the data behind Concepticon.☆39Updated last week