problemsniper / Crawl-Wiki-For-AcronymsLinks
Crawling Wikipedia to extract some Data
☆18Updated 2 years ago
Alternatives and similar repositories for Crawl-Wiki-For-Acronyms
Users that are interested in Crawl-Wiki-For-Acronyms are comparing it to the libraries listed below
Sorting:
- rasactl deploys Rasa X / Enterprise on your local or remote Kubernetes cluster and manages Rasa X / Enterprise deployments.☆15Updated 3 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated last year
- ☆44Updated 4 years ago
- MozoLM: A language model (LM) serving library☆45Updated this week
- Launch NMT tasks on the cloud☆13Updated 2 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-memcache☆13Updated 2 years ago
- The Atlas of Pidgin and Creole Language Structures☆12Updated 2 years ago
- it's art☆10Updated 10 years ago
- The World Atlas Of Language Structures Online☆129Updated 6 months ago
- Collaborative data curation for Glottolog☆168Updated last week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-recommendations-ai☆18Updated 2 years ago
- Code for constructing TLDR corpus from Reddit dataset☆25Updated 3 years ago
- Language data and utilities☆18Updated last week
- Code and scripts used to automate delivery of tool packages used in virtual-environments.☆22Updated 3 years ago
- Login page and core auth library☆19Updated 5 months ago
- Tools to construct and process Common Crawl webgraphs☆92Updated last week
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆43Updated 4 years ago
- API schema of Jina command line interface exposed as JSON and YAML files.☆13Updated 5 months ago
- The Directory of Open Access Journals - website and directory software☆60Updated this week
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆13Updated 2 years ago
- Crawler for linguistic corpora☆205Updated last year
- Script for downloading GitHub.☆12Updated 4 years ago
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research only☆26Updated 3 months ago
- Get the scholarly citation for any research product: software, preprint, paper, or dataset☆82Updated 2 years ago
- A curated list of awesome resources for COVID-19☆37Updated 5 years ago
- Simple Python client for the Hugging Face Inference API☆74Updated 4 years ago
- Experiments to help discussion on Wikipedia talk pages☆66Updated last week
- wordnik python3 library☆78Updated last year
- client app for the gRPC health-checking protocol☆100Updated 5 years ago
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆23Updated 2 years ago