bit-ml / VeriDark
Dark Web Authorship Verification Dataset
☆12Updated last year
Alternatives and similar repositories for VeriDark:
Users that are interested in VeriDark are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- An agent-based model for scientific inquiry based on abstract argumentation☆11Updated 3 years ago
- ☆11Updated 3 years ago
- The Union of Intersections Framework in Python☆14Updated this week
- ☆10Updated 4 years ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆16Updated 5 months ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 9 months ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 3 years ago
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- Data and Code for "The Values Encoded in Machine Learning Research"☆44Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…☆45Updated 2 weeks ago
- Authorship Verification in Social Media via Attention-based Similarity Learning☆22Updated 3 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆48Updated 9 months ago
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆12Updated 5 years ago
- A toolkit for social media information extraction using multi-task learning and active learning☆19Updated 2 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- Data and code related to the report "Truth, Lies, and Automation: How Language Models Could Change Disinformation"☆27Updated 3 years ago
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Relevant code for the "Show Your Work" paper, EMNLP 2019.☆18Updated 5 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆19Updated 2 years ago
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆22Updated last year
- ☆33Updated 11 years ago
- Making a bridge between NLP models and Brain data☆18Updated 4 years ago
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆20Updated 4 years ago
- ☆30Updated 3 years ago
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated last year