AlonEirew/wikipedia-to-elastic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlonEirew/wikipedia-to-elastic)

AlonEirew / wikipedia-to-elastic

Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual support)

☆49

Alternatives and similar repositories for wikipedia-to-elastic

Users that are interested in wikipedia-to-elastic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eyaler / hebrew_tokenizer
View on GitHub
A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…
☆23Aug 13, 2022Updated 3 years ago
vered1986 / OKR
View on GitHub
OKR: A Consolidated Open Knowledge Representation for Multiple Texts
☆41Jan 25, 2018Updated 8 years ago
BBN-E / Rapid-customization-events-acl19
View on GitHub
☆12Sep 30, 2022Updated 3 years ago
kermitt2 / grisp
View on GitHub
Knowledge Base stuff
☆23Mar 1, 2026Updated 4 months ago
aviclu / CDLM
View on GitHub
☆51May 11, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ns-moosavi / coval
View on GitHub
A coreference evaluation package for the CoNLL and ARRAU datasets
☆42Oct 3, 2020Updated 5 years ago
peitseyang / Altering_Facial_Features
View on GitHub
my graduation_project in CSIE
☆11Dec 20, 2018Updated 7 years ago
lkaihua / Bipeline
View on GitHub
A Web-Based Visualization Tool for Biclustering of Multivariate Time Series
☆10Feb 17, 2023Updated 3 years ago
blester125 / iobes
View on GitHub
Tool for parsing and converting various span encoding schemes.
☆23Jan 13, 2024Updated 2 years ago
ariecattan / coref
View on GitHub
☆37Jun 12, 2023Updated 3 years ago
irsyadpage / NoteFinder
View on GitHub
Search comments and highlights annotations in PDF documents.
☆12May 4, 2023Updated 3 years ago
samuelbroscheit / wikiextractor-wikimentions
View on GitHub
A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps
☆11Apr 18, 2019Updated 7 years ago
D2KLab / relink
View on GitHub
Context-enhanced Adaptive Entity Linking
☆13Mar 21, 2016Updated 10 years ago
chili-epfl / Trend-Detection
View on GitHub
Detecting Trends in Job Advertisements
☆20Aug 13, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aribornstein / NLPToolkits2019Notebook
View on GitHub
7 Amazing Open Source NLP Tools to Try With Notebooks in 2019
☆22Dec 5, 2020Updated 5 years ago
ghaddarAbs / WiNER
View on GitHub
☆32Aug 4, 2021Updated 4 years ago
qurator-spk / sbb_ned
View on GitHub
Named Entity Disambiguation and Linking
☆16May 24, 2024Updated 2 years ago
shaialon / elasticsearch-gdelt
View on GitHub
Elasticsearch 6.x + Node.js - Visualize Gdelt data with Kibana & Elastic: http://www.gdeltproject.org/
☆25Jan 24, 2018Updated 8 years ago
osirrc / osirrc2019-library
View on GitHub
Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)
☆13Jul 7, 2019Updated 7 years ago
shyamupa / wikidump_preprocessing
View on GitHub
Extracting useful metadata from Wikipedia dumps in any language.
☆26Sep 20, 2019Updated 6 years ago
zhaolewen / DrQA-TF
View on GitHub
DrQA with Tensorflow
☆11Oct 28, 2017Updated 8 years ago
ryosuzuki / trace-diff
View on GitHub
[VL/HCC 2017] TraceDiff: Debugging Unexpected Code Behavior Using Trace Divergences
☆12Sep 2, 2017Updated 8 years ago
oriern / SuperPAL
View on GitHub
☆24May 31, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WatChMaL / WatChMaL
View on GitHub
☆14Jun 16, 2026Updated last month
sdsc-ordes / kg-llm-interface
View on GitHub
Langchain-powered natural language interface to knowledge-graphs.
☆17Nov 3, 2025Updated 8 months ago
EsraaMadi / similarity-search-weaviate
View on GitHub
Text/Image search for similar products
☆12Aug 12, 2022Updated 3 years ago
iptc / extra-ext
View on GitHub
API implementation, User Interface, and more modules of the IPTC EXTRA project
☆13Feb 14, 2022Updated 4 years ago
danielhers / tupa
View on GitHub
Transition-based UCCA Parser
☆74Dec 14, 2020Updated 5 years ago
harshvardhanpro / E-Commerce
View on GitHub
Basic functions of online shopping app like SignIn, SignUp, GoogleSignIn, Signout{Using Firebase Authentication}, Product details with Vi…
☆10Jun 14, 2017Updated 9 years ago
Aayushjn / E-Commerce-App
View on GitHub
E-Commerce Android app written in Java
☆10Apr 18, 2018Updated 8 years ago
kiankd / events
View on GitHub
Repository for *SEM Paper on Event Coreference Resolution in ECB+
☆22Oct 1, 2018Updated 7 years ago
susravan / Edge-and-light-detection-android-app
View on GitHub
An android app that shows the edges and light sources in the live feed from the phone's camera
☆11Sep 11, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jiangycTarheel-zz / Adversarial-MultiHopQA
View on GitHub
☆10Aug 22, 2023Updated 2 years ago
Virtual-Protocol / game-agentic-engine-module
View on GitHub
☆14May 8, 2024Updated 2 years ago
ddddwee1 / sul
View on GitHub
Simple but Useful Layers based on Tensorflow
☆14Mar 29, 2020Updated 6 years ago
allenai / unifew
View on GitHub
Unifew: Unified Fewshot Learning Model
☆18Sep 10, 2021Updated 4 years ago
smallhadroncollider / dotfiles
View on GitHub
-.. --- - ..-. .. .-.. . ...
☆11Sep 16, 2021Updated 4 years ago
kmc-jp / channel-killer
View on GitHub
長期間発言のないslackのチャンネルを殺す(archive)
☆10Jan 10, 2023Updated 3 years ago
SJSU272LabF17 / StockPredict-ML
View on GitHub
Stock prediction using FB Prophet algorithm
☆13Dec 8, 2017Updated 8 years ago