Extracting useful metadata from Wikipedia dumps in any language.
☆26Sep 20, 2019Updated 6 years ago
Alternatives and similar repositories for wikidump_preprocessing
Users that are interested in wikidump_preprocessing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 19, 2018Updated 7 years ago
- Vectorizing knowledge bases for entity linking☆15Feb 21, 2021Updated 5 years ago
- pytorch model for cross-lingual entity linking.☆16Mar 13, 2019Updated 7 years ago
- Code for neural-el - EMNLP'17☆84Mar 24, 2023Updated 3 years ago
- Code for EMNLP 2018 paper https://arxiv.org/pdf/1808.09075.pdf☆38Aug 23, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Neural Architectures for Fine Grained Entity Type Classification☆85Oct 7, 2017Updated 8 years ago
- ☆43Feb 3, 2019Updated 7 years ago
- Representation Learning of Entities and Documents from Knowledge Base Descriptions☆18Oct 6, 2018Updated 7 years ago
- A tool for extracting plain text from Wikipedia dumps☆15Oct 3, 2019Updated 6 years ago
- Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model☆18Aug 2, 2021Updated 4 years ago
- [deprecated] reference code for string segmentation using LSTM(tensorflow)☆19Feb 19, 2020Updated 6 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- ☆10May 1, 2025Updated 11 months ago
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""☆32Apr 25, 2025Updated 11 months ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 11 months ago
- Fine-Grained Entity Type Classification by Jointly Learning Representations and Label Embeddings☆19Feb 26, 2019Updated 7 years ago
- ☆11Apr 20, 2020Updated 5 years ago
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, …☆14Mar 15, 2022Updated 4 years ago
- A Python library for variable type checker/validator/converter at a run time.☆17Oct 27, 2025Updated 5 months ago
- Censored tweets annotated for specificity; AAAI 2019 paper: Predicting and Analyzing Language Specificity in Social Media Posts☆11Oct 19, 2021Updated 4 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarization☆18Oct 21, 2024Updated last year
- Indonesian part-of-speech (POS) tagging☆15Jul 24, 2022Updated 3 years ago
- A deliberately simple Django app for managing IT inventory☆13Jul 14, 2016Updated 9 years ago
- Curated list of Wikidata Projects☆24Mar 3, 2026Updated last month
- ☆14Feb 15, 2016Updated 10 years ago
- PyTorch implementation of "Variational Autoencoders with Jointly Optimized Latent Dependency Structure" [ICLR 2019]☆13Jul 14, 2019Updated 6 years ago
- ☆22Jan 5, 2024Updated 2 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- ☆13Jun 30, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the ICLR 2019 paper "Learning to Represent Edits"☆13Dec 8, 2022Updated 3 years ago
- ☆34Nov 29, 2016Updated 9 years ago
- The awesome papers for biomedical entitiy linking/entity alignment/NEL☆38Dec 2, 2019Updated 6 years ago
- LTL2PDDL tool☆11Jul 7, 2017Updated 8 years ago
- This is the official code for the paper 'Systematically Exploring Redundancy Reduction inSummarizing Long Documents'.☆16Apr 30, 2021Updated 4 years ago
- Named Entity Disambiguation for Noisy Text☆66Jun 26, 2017Updated 8 years ago
- ☆23Oct 15, 2022Updated 3 years ago