problemsniper / Crawl-Wiki-For-AcronymsLinks
Crawling Wikipedia to extract some Data
☆18Updated 2 years ago
Alternatives and similar repositories for Crawl-Wiki-For-Acronyms
Users that are interested in Crawl-Wiki-For-Acronyms are comparing it to the libraries listed below
Sorting:
- rasactl deploys Rasa X / Enterprise on your local or remote Kubernetes cluster and manages Rasa X / Enterprise deployments.☆15Updated 3 years ago
- Launch NMT tasks on the cloud☆13Updated 2 years ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆16Updated 2 years ago
- A simulator for zombie apocalypse with scipy☆24Updated 7 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-memcache☆13Updated 2 years ago
- Tools to construct and process Common Crawl webgraphs☆99Updated this week
- Citation bot is a tool to expand and format references at Wikipedia. It retrieves citation data from a variety of sources including Cross…☆65Updated this week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-connection☆29Updated 2 years ago
- Statistical text analysis and semantic networks with Python☆13Updated 7 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-monitoring-dashboards☆19Updated last year
- ☆43Updated 9 months ago
- OpenAPI definitions for the Federated Data Sharing Common API☆17Updated 3 years ago
- [archived]☆18Updated 4 years ago
- Repository for Discussions and Materials about The Carpentries Workbench☆19Updated last week
- Experiments to help discussion on Wikipedia talk pages☆67Updated last month
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-recommendations-ai☆18Updated 2 years ago
- Archive.org API Server☆38Updated last year
- This repository is used for maintaining the SDMX-ML format specification☆10Updated 5 months ago
- Language detection using Spacy and Fasttext☆57Updated last year
- How do we process data in different formats like docx, pdf etc and generate insights to be linked with structured data in database?This p…☆14Updated 5 years ago
- Experimental/test repo for the Canada.ca design library - the official repo is over here: https://github.com/canada-ca/design-system and …☆10Updated 3 years ago
- This directory gathers the tools developed by the Data Sourcing Working Group☆31Updated 4 years ago
- Issues and milestones for the DataCite organization☆47Updated last year
- ☆44Updated 4 years ago
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆23Updated 3 years ago
- Awk based command-line tool to access some Wikimedia API functions☆36Updated 2 months ago
- DNS records for Jekyll properties. Uses octodns to sync.☆14Updated 2 weeks ago
- Proposed production data for CLDR data☆29Updated this week
- Retired repository for Machine Learning utils at the Wellcome Trust (now deprecated).☆31Updated 2 years ago
- Simple Python client for the Hugging Face Inference API☆75Updated 5 years ago