Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.
☆38Aug 12, 2018Updated 7 years ago
Alternatives and similar repositories for wikireverse
Users that are interested in wikireverse are comparing it to the libraries listed below
Sorting:
- FoGFaaS: Add serverless computing (faas) to ifogsim☆22Mar 30, 2025Updated 11 months ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Sep 5, 2012Updated 13 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Jun 5, 2017Updated 8 years ago
- ☆15Feb 8, 2015Updated 11 years ago
- Neural Response Ranker for Alana, Heriot-Watt University's Alexa Prize Socialbot☆12Nov 21, 2022Updated 3 years ago
- Creative Commons Media-Fingerprint Library☆12Sep 23, 2013Updated 12 years ago
- Fast structured perceptron sequential labeler☆15Dec 8, 2015Updated 10 years ago
- A sample application that consumes from twitter using HBC and producing into Amazon Kinesis☆12Oct 20, 2015Updated 10 years ago
- Cassandra based storage layer for JGit☆34Mar 5, 2011Updated 15 years ago
- The core NLP library for automatic question generation☆17Mar 7, 2017Updated 8 years ago
- ☆18Jun 16, 2025Updated 8 months ago
- ☆14Feb 26, 2022Updated 4 years ago
- Stand-alone service for fuzzy lookup of string labels of resources☆17May 8, 2017Updated 8 years ago
- VLog is a high-performance Datalog engine. It is highly memory efficient and can process large programs with thousands of rules.☆17Mar 22, 2018Updated 7 years ago
- Node bindings for MIT Information Extraction https://github.com/mit-nlp/MITIE☆22May 22, 2019Updated 6 years ago
- DBpedia Open Text Extraction Challenge - a never ending knowledge acquisition spiral☆19Aug 7, 2017Updated 8 years ago
- Exploratory data analysis on various datasets from FiveThirtyEight and Udacity coursework☆23Mar 8, 2017Updated 8 years ago
- NLP Utilities in Java☆43Dec 14, 2022Updated 3 years ago
- SKOS Support for Apache Lucene and Solr☆56May 12, 2021Updated 4 years ago
- ☆25Jul 6, 2023Updated 2 years ago
- ☆22Aug 24, 2017Updated 8 years ago
- Text classification code described in "SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines" by Roy Schwartz, Sam Thomson and No…☆54Jul 7, 2022Updated 3 years ago
- Context Selection for Embedding Models☆27Nov 2, 2017Updated 8 years ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆34Oct 25, 2024Updated last year
- Convert SPARQL results to a pandas dataframe☆28Oct 23, 2024Updated last year
- Frictionless Machine Learning on Kubernetes☆15Mar 7, 2023Updated 2 years ago
- Nordlys: Toolkit for entity-oriented and semantic search☆31Mar 23, 2021Updated 4 years ago
- ECS Scheduler for Running Massive Parallel Computations☆71Feb 2, 2016Updated 10 years ago
- Alternative implementation of the coreference scorer for the CoNLL-2011/2012 shared tasks on coreference resolution☆11Apr 29, 2021Updated 4 years ago
- Generates a set of property-specific entity embeddings from knowledge graphs using node2vec☆78Jul 30, 2021Updated 4 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- Based on Neural Amp Modeler 0.7.1 with some enhanced features☆12Apr 18, 2023Updated 2 years ago
- Neural network language models, including feed-forward neural network, recurrent neural network, long-short term memory neural network.☆11Aug 3, 2017Updated 8 years ago
- Mirror of official OpenEMR Sourceforge repository☆18Updated this week
- Data source of the Energy Transition Model☆18Updated this week
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- ☆12Feb 16, 2024Updated 2 years ago
- administrative software for local food networks, in django☆16Oct 7, 2020Updated 5 years ago
- ☆11Aug 17, 2014Updated 11 years ago