songweige / Dmoz-DatasetView external linksLinks
content.rdf.u8.gz
☆10Dec 15, 2020Updated 5 years ago
Alternatives and similar repositories for Dmoz-Dataset
Users that are interested in Dmoz-Dataset are comparing it to the libraries listed below
Sorting:
- Corpus of domain names scraped from Common Crawl and manually annotated to add word boundaries (e.g. "commoncrawl" to "common crawl").☆20Jun 16, 2025Updated 7 months ago
- ☆24Jun 12, 2023Updated 2 years ago
- the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization☆23Mar 10, 2019Updated 6 years ago
- Deep learning algorithms for web page classification written in Tensorflow (Python).☆23Oct 8, 2022Updated 3 years ago
- For <Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation>. Accepted by ACL2019☆26Oct 23, 2020Updated 5 years ago
- Code and Data for our EMNLP 2020 paper titled 'Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multiho…☆28Feb 9, 2022Updated 4 years ago
- A paper list of research conducted based on wikiHow☆27Mar 5, 2022Updated 3 years ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆14Jan 6, 2025Updated last year
- ☆12Sep 22, 2015Updated 10 years ago
- Automatic subordinate clause extractor☆11Jul 7, 2022Updated 3 years ago
- Source code for paper "Looking Beyond Label Noise: Shifted Label Distribution Matters in Distantly Supervised Relation Extraction" (EMNLP…☆39Oct 29, 2019Updated 6 years ago
- Simple implementation of text-based Gridworld game. Intended for use with reinforcement learning algorithms.☆15Apr 29, 2018Updated 7 years ago
- Today's News Online (TNO) is a news aggregation system that takes in news sources of varying types and provides a single location for cli…☆16Updated this week
- NLP Preprocessing Pipeline Wrappers☆11May 12, 2023Updated 2 years ago
- DISCO: Comprehensive and Explainable Disinformation Detection, CIKM 2022☆10May 5, 2023Updated 2 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- ☆10Apr 28, 2021Updated 4 years ago
- lime-ner: extending LIME for Named Entity Recognition☆10Aug 15, 2018Updated 7 years ago
- Cluster paraphrases by word sense☆12Jan 3, 2019Updated 7 years ago
- Repo for the Unified Verbs Index Project☆12Feb 3, 2026Updated last week
- Repo collects Homework code for DSCI552/INF552 @USC 20Fall Semester.☆14Nov 27, 2020Updated 5 years ago
- ☆11Aug 28, 2017Updated 8 years ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆11Apr 30, 2024Updated last year
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- ☆14Feb 4, 2026Updated last week
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))☆13Dec 21, 2023Updated 2 years ago
- Open-source, knowledge-grounded conversational assistant☆14Jun 30, 2025Updated 7 months ago
- ☆12Jan 7, 2020Updated 6 years ago
- Enhancing Sentence Embedding with Generalized Pooling☆11Jul 26, 2018Updated 7 years ago
- Solving Logic Grid Puzzles with Part-of-Speech Tagging and First-Order Logic☆11Dec 18, 2016Updated 9 years ago
- Plots a word association graph between the nouns in a given text with the adjectives and verbs in the text☆11Jul 19, 2019Updated 6 years ago
- Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP…☆10Jun 11, 2022Updated 3 years ago
- Sandbox for playing with Neo4J and graph approaches to NLP☆12Jul 12, 2017Updated 8 years ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …☆13Jul 13, 2022Updated 3 years ago