problemsniper / Crawl-Wiki-For-AcronymsLinks
Crawling Wikipedia to extract some Data
☆18Updated 2 years ago
Alternatives and similar repositories for Crawl-Wiki-For-Acronyms
Users that are interested in Crawl-Wiki-For-Acronyms are comparing it to the libraries listed below
Sorting:
- ☆45Updated 4 years ago
- Proposed production data for CLDR data☆29Updated this week
- Login page and core auth library☆20Updated 11 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-memcache☆13Updated 2 years ago
- it's art☆10Updated 10 years ago
- Language data and utilities☆18Updated 2 weeks ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated last year
- rasactl deploys Rasa X / Enterprise on your local or remote Kubernetes cluster and manages Rasa X / Enterprise deployments.☆15Updated 3 years ago
- MozoLM: A language model (LM) serving library☆47Updated last week
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆47Updated 5 years ago
- Original schema.org python-appengine codebase☆19Updated 3 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-monitoring-dashboards☆19Updated 2 years ago
- Fork from python/cpython☆12Updated 7 years ago
- Collaborative data curation for Glottolog☆184Updated last week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-storage-transfer☆12Updated 2 years ago
- ☆27Updated last week
- Tools to construct and process Common Crawl webgraphs☆105Updated last week
- PHOIBLE Online☆46Updated 3 years ago
- WikiLoop DoubleCheck: a web tool to help review Wikipedia edits easily and collaboratively.☆82Updated last year
- The Directory of Open Access Journals - website and directory software☆62Updated this week
- The World Atlas Of Language Structures Online☆131Updated 3 weeks ago
- Flat files containing available context annotation entities.☆35Updated 3 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆36Updated 3 months ago
- The Unicode Cookbook for Linguists☆56Updated 5 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-recommendations-ai☆18Updated 2 years ago
- Code for constructing TLDR corpus from Reddit dataset☆27Updated 4 years ago
- Analysis of gutenberg dataset☆44Updated 7 years ago
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated 3 years ago
- Samples used in Google Cloud Storage documentation.☆18Updated 7 years ago