Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual support)
☆48Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for wikipedia-to-elastic
Users that are interested in wikipedia-to-elastic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆22Aug 13, 2022Updated 3 years ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆11Dec 27, 2021Updated 4 years ago
- OKR: A Consolidated Open Knowledge Representation for Multiple Texts☆41Jan 25, 2018Updated 8 years ago
- ☆12Sep 30, 2022Updated 3 years ago
- A set of utility scripts to process Wikipedia related data☆38Jul 2, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆51May 11, 2022Updated 3 years ago
- Accompanying code for our EMNLP 2018 Demo paper "Interactive Instance-based Evaluation of Knowledge Base Question Answering"☆13Jul 29, 2019Updated 6 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆42Oct 3, 2020Updated 5 years ago
- Tool for parsing and converting various span encoding schemes.☆23Jan 13, 2024Updated 2 years ago
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 7 years ago
- Detecting Trends in Job Advertisements☆20Aug 13, 2018Updated 7 years ago
- my graduation_project in CSIE☆11Dec 20, 2018Updated 7 years ago
- Web application for interactive graphs, anomaly highlighting and online monitoring.☆17Mar 15, 2016Updated 10 years ago
- Scalable Topic Modeling using Variational Inference in MapReduce☆149Oct 20, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- ☆32Aug 4, 2021Updated 4 years ago
- Named Entity Disambiguation and Linking☆16May 24, 2024Updated last year
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Jul 7, 2019Updated 6 years ago
- Extracting useful metadata from Wikipedia dumps in any language.☆26Sep 20, 2019Updated 6 years ago
- ☆15Mar 27, 2026Updated 3 weeks ago
- Non-distributional linguistic word vector representations.☆62Sep 15, 2017Updated 8 years ago
- Index and Search Your Private PDF Collection☆18Jan 16, 2016Updated 10 years ago
- Image search based on convolutional neural network feature extraction.☆14May 11, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple utility to index wikipedia dumps using Lucene.☆21Oct 13, 2020Updated 5 years ago
- DrQA with Tensorflow☆11Oct 28, 2017Updated 8 years ago
- Utilities for working with W&B and PyTorch Lightning in an educational context☆15Aug 4, 2021Updated 4 years ago
- Biblioteca que implementa o uso da API do articlemeta.☆11Oct 25, 2022Updated 3 years ago
- Weakly-supervised action segmentation in video☆16Feb 13, 2022Updated 4 years ago
- ☆14May 8, 2024Updated last year
- [VL/HCC 2017] TraceDiff: Debugging Unexpected Code Behavior Using Trace Divergences☆12Sep 2, 2017Updated 8 years ago
- Python tools for parsing Wikipedia/MediaWiki database dumps☆23Feb 28, 2013Updated 13 years ago
- ☆16Sep 3, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- init☆13Feb 3, 2021Updated 5 years ago
- Crawled Wikipedia Tables with Passages☆13Aug 19, 2021Updated 4 years ago
- ☆14Apr 6, 2014Updated 12 years ago
- Session hijacking GUI tool☆15Oct 20, 2013Updated 12 years ago
- ☆21Oct 13, 2021Updated 4 years ago
- A Material design baking/cooking recipes app.☆11Feb 9, 2019Updated 7 years ago
- ☆11Dec 2, 2024Updated last year