jodaiber/Annotated-WikiExtractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jodaiber/Annotated-WikiExtractor)

jodaiber / Annotated-WikiExtractor

Simple Wikipedia plain text extractor with article link annotations and Hadoop support.

☆103

Alternatives and similar repositories for Annotated-WikiExtractor

Users that are interested in Annotated-WikiExtractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wikilinks / neleval
View on GitHub
Entity disambiguation evaluation and error analysis tool
☆116Mar 19, 2023Updated 3 years ago
rloganiv / kglm-data
View on GitHub
Code used to create the Linked WikiText-2 dataset
☆16May 22, 2023Updated 3 years ago
WikiExtractor / wikiextractor
View on GitHub
A tool for extracting plain text from Wikipedia dumps
☆3,996Updated this week
hasibi / TAGME-Reproducibility
View on GitHub
Reproducibility of the TAGME entity linking system
☆60May 10, 2019Updated 7 years ago
matthewfl / nlp-entity-convnet
View on GitHub
Convolutional network for entity linking (Naacl 2016)
☆58Jul 19, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bwbaugh / wikipedia-extractor
View on GitHub
This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…
☆260Aug 17, 2016Updated 9 years ago
minwoo / TriCRF
View on GitHub
Triangular-chain CRF
☆25Jul 27, 2015Updated 10 years ago
jeffheaton / article-code
View on GitHub
☆23Oct 11, 2019Updated 6 years ago
faraday / wikiprep-esa
View on GitHub
ESA implementation using Wikiprep output
☆56Oct 18, 2013Updated 12 years ago
saffsd / wikidump
View on GitHub
Tools to manipulate and extract data from wikipedia dumps
☆47May 21, 2013Updated 13 years ago
wikilinks / conll03_nel_eval
View on GitHub
Python evaluation scripts for AIDA-formatted CoNLL data
☆20Aug 4, 2014Updated 11 years ago
dalab / pboh-entity-linking
View on GitHub
Source code for the paper "Probabilistic Bag-Of-Hyperlinks Model for Entity Linking" , http://dl.acm.org/citation.cfm?id=2882988
☆58Oct 28, 2018Updated 7 years ago
ryansb / ipylogue
View on GitHub
A git-backed store for ipython notebooks
☆13Apr 13, 2015Updated 11 years ago
coastalcph / supersense-data-twitter
View on GitHub
Tweets annotated with coarse-grained sense labels (supersenses)
☆13Jun 13, 2014Updated 12 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
timvieira / rl
View on GitHub
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Jan 28, 2021Updated 5 years ago
G-Research / dgraph-dbpedia
View on GitHub
Pre-processing DBpedia datasets to load into Dgraph
☆13Mar 6, 2022Updated 4 years ago
cttsai / illinois-cross-lingual-wikifier
View on GitHub
☆24Sep 28, 2017Updated 8 years ago
hasibi / EntityLinkingInQueries-Methods
View on GitHub
Entity Linking in Queries: Efficiency vs. Effectiveness
☆18Nov 16, 2017Updated 8 years ago
AdeDZY / SIGIR19-BERT-IR
View on GitHub
Repo of code and data for SIGIR-19 short paper "Deeper Text Understanding for IR with Contextual NeuralLanguage Modeling"
☆163Jan 3, 2020Updated 6 years ago
JonathanRaiman / wikipedia_ner
View on GitHub
Labeled examples from wiki dumps in Python
☆67Aug 8, 2016Updated 9 years ago
semanticize / semanticizer
View on GitHub
Entity Linking for the masses
☆57Nov 10, 2015Updated 10 years ago
rmit-ir / KrovetzStemmer
View on GitHub
Python binding to the KrovetzStemmer package (C++ version)
☆14Feb 12, 2023Updated 3 years ago
AntNLP / antu
View on GitHub
Universal data IO and neural network modules in NLP tasks.
☆18Apr 13, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
c-amr / camr
View on GitHub
Transition-based tree-to-graph AMR Parser
☆127Feb 18, 2018Updated 8 years ago
bloomberg / cnn-rnf
View on GitHub
Convolutional Neural Networks with Recurrent Neural Filters
☆53Apr 15, 2019Updated 7 years ago
SUDA-LA / dep-cross-domain
View on GitHub
[ACL'19] Code for "Semi-supervised Domain Adaptation for Dependency Parsing"
☆14Jun 14, 2019Updated 7 years ago
google-research-datasets / wiki-split
View on GitHub
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
☆125Jun 3, 2019Updated 7 years ago
U-Alberta / wned
View on GitHub
A sytem for Named Entity Disambiguation based on Random Walks and Learning to Rank.
☆19Feb 26, 2022Updated 4 years ago
dykang / adventure
View on GitHub
code for ACL 2018 paper by Kang et al., "AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples "
☆17Aug 30, 2019Updated 6 years ago
anuzzolese / oke-challenge-2016
View on GitHub
☆22Aug 24, 2017Updated 8 years ago
peitseyang / Altering_Facial_Features
View on GitHub
my graduation_project in CSIE
☆11Dec 20, 2018Updated 7 years ago
idio / wiki2vec
View on GitHub
Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby
☆602Jan 11, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dbpedia-spotlight / dbpedia-spotlight
View on GitHub
DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.
☆759Mar 8, 2018Updated 8 years ago
JT-Ushio / ECNU17_Summer_Seminar
View on GitHub
ECNU NLP group learns CS224n in the form of seminars in the 2017 summer.
☆10Aug 12, 2017Updated 8 years ago
marcocor / bat-framework
View on GitHub
A framework to compare entity linking systems.
☆38Jul 29, 2018Updated 7 years ago
tomhosking / torchseq
View on GitHub
Pytorch Seq2Seq framework
☆27Feb 18, 2026Updated 5 months ago
diffbot / wikistatsextractor
View on GitHub
Extract statistics from Wikipedia Dump files.
☆26Aug 2, 2021Updated 4 years ago
aiUIUC / pyAIUtils
View on GitHub
Utility functions and classes for building Artificial Intelligence systems in Python
☆23Apr 14, 2017Updated 9 years ago
giuseppetotaro / ctakes-clinical-pipeline
View on GitHub
Clinical Pipeline Engine using Apache cTAKES
☆24Nov 9, 2015Updated 10 years ago