problemsniper / Crawl-Wiki-For-Acronyms
Crawling Wikipedia to extract some Data
☆17Updated last year
Alternatives and similar repositories for Crawl-Wiki-For-Acronyms:
Users that are interested in Crawl-Wiki-For-Acronyms are comparing it to the libraries listed below
- Generate language n-gram statistics☆17Updated 2 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-memcache☆13Updated last year
- List of (possible) English hedge words☆45Updated 2 years ago
- Towards Neural Phrase-based Machine Translation☆12Updated last year
- Open Use of Data Agreement - Removing Barriers to Data Innovation☆17Updated 3 years ago
- A dashboard that visualizes publicly released Google Vaccination Search data.☆19Updated 3 months ago
- Computational Use of Data Agreement - Removing Barriers to Data Innovation☆20Updated last year
- ☆43Updated 3 years ago
- Markdown builder functions.☆13Updated 2 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- MozoLM: A language model (LM) serving library☆44Updated 2 months ago
- ☆10Updated 2 years ago
- PHOIBLE Online☆42Updated 2 years ago
- Zenodo Developers Site☆11Updated 7 months ago
- List of easy American-English words: The New Dale-Chall (1995)☆32Updated 2 years ago
- Scripts to take hand washing related text in (almost) any language and float it into a hand washing poster.☆9Updated 3 years ago
- Tesseract source code and API documentation☆13Updated 3 years ago
- An open, comprehensive catalog of scholarship, connecting papers, authors, institutions, and journals.☆10Updated last year
- A JS parser for (binary) `.npy` files.☆16Updated 2 years ago
- DCMI Usage Board - meeting record and decisions☆9Updated 9 months ago
- generate rules from lists of words☆16Updated 3 years ago
- Service to automatically generate specs from various source formats☆25Updated this week
- A VSCode extension that check links in Markdown to ensure they are valid.☆16Updated last year
- State of the Map 2020 website☆9Updated 2 months ago
- The socket.io layer of Overleaf for real-time editor interactions☆17Updated 3 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-storage-transfer☆12Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 7 months ago
- WikiLoop DoubleCheck: a web tool to help review Wikipedia edits easily and collaboratively.☆80Updated 7 months ago
- DNS records for Jekyll properties. Uses octodns to sync.☆13Updated 2 months ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Updated 3 years ago