google-research-datasets / common-crawl-domain-namesView external linksLinks
Corpus of domain names scraped from Common Crawl and manually annotated to add word boundaries (e.g. "commoncrawl" to "common crawl").
☆20Jun 16, 2025Updated 8 months ago
Alternatives and similar repositories for common-crawl-domain-names
Users that are interested in common-crawl-domain-names are comparing it to the libraries listed below
Sorting:
- content.rdf.u8.gz☆10Dec 15, 2020Updated 5 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Nov 10, 2020Updated 5 years ago
- For <Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation>. Accepted by ACL2019☆26Oct 23, 2020Updated 5 years ago
- A paper list of research conducted based on wikiHow☆27Mar 5, 2022Updated 3 years ago
- ☆12Sep 22, 2015Updated 10 years ago
- Automatic subordinate clause extractor☆11Jul 7, 2022Updated 3 years ago
- AutoBench: Benchmarking Automation for Intelligent Document Processing (IDP) with confidence☆11Mar 18, 2025Updated 10 months ago
- lime-ner: extending LIME for Named Entity Recognition☆10Aug 15, 2018Updated 7 years ago
- Visual Studio Solution Starter Kit to download and base any Content Hub Development on. It supports Intellisense, Sync, Debugging and Uni…☆11Jan 14, 2026Updated last month
- Quels élus de la République (députés, ministres, maires) utilisent toujours x.com ?☆14Feb 8, 2026Updated last week
- ACK service controller for Amazon DynamoDB☆14Jan 8, 2026Updated last month
- Use Python to Automate the PowerPoint Update☆15May 28, 2023Updated 2 years ago
- ☆10Apr 28, 2021Updated 4 years ago
- Use Stage Variables in API Gateway to point to different version of AWS Lambda Functions.☆12May 7, 2021Updated 4 years ago
- Certified Kubernetes Application Development training☆11Feb 28, 2020Updated 5 years ago
- How to build your own policy engine☆14Jul 24, 2022Updated 3 years ago
- Repo collects Homework code for DSCI552/INF552 @USC 20Fall Semester.☆14Nov 27, 2020Updated 5 years ago
- Simple implementation of text-based Gridworld game. Intended for use with reinforcement learning algorithms.☆15Apr 29, 2018Updated 7 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- Repo for this AWS Blogpost☆14Jan 30, 2026Updated 2 weeks ago
- Website that helps you to grind leetcode based on your previous activity☆11Oct 23, 2021Updated 4 years ago
- ☆14Apr 29, 2024Updated last year
- Rust client library for @tailscale.☆11Oct 5, 2022Updated 3 years ago
- ☆12Jan 26, 2024Updated 2 years ago
- The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))☆13Dec 21, 2023Updated 2 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Platform API as Configuration☆10Aug 18, 2020Updated 5 years ago
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 2 years ago
- Enhancing Sentence Embedding with Generalized Pooling☆11Jul 26, 2018Updated 7 years ago
- Solving Logic Grid Puzzles with Part-of-Speech Tagging and First-Order Logic☆11Dec 18, 2016Updated 9 years ago
- Cluster paraphrases by word sense☆12Jan 3, 2019Updated 7 years ago
- SpExtor: Sparse Entity Extractor☆11Feb 10, 2020Updated 6 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- Quickly open links from from kubernetes resources using jsonpath templates.☆13Apr 3, 2024Updated last year
- An Amazon Kendra REST API CDK example with an API Gateway, including authentication with AWS Cognito and AWS X-Ray Tracing☆17Apr 10, 2025Updated 10 months ago
- Collection of code examples, snippets, demos for running Python in Snowflake☆24Aug 6, 2025Updated 6 months ago
- ☆10Feb 27, 2020Updated 5 years ago
- Improve Pod scheduling by making Kuberentes aware of its network topology, reducing latency and data transfer cost☆71Sep 3, 2025Updated 5 months ago