[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
☆106Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for TweebankNLP
Users that are interested in TweebankNLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- Grammar test suite for masked language models☆10Jan 1, 2023Updated 3 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Replication code for "The Structure of Toxic Conversations on Twitter" (WWW'21)☆10May 25, 2021Updated 4 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 6 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 5 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆606Jul 22, 2024Updated last year
- NELA Features for News Veracity. Used in multiple studies.☆10Oct 14, 2020Updated 5 years ago
- Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts☆16Apr 2, 2026Updated last week
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆50Aug 20, 2024Updated last year
- A repository to keep tools, scripts, data for SMART task.☆11May 24, 2022Updated 3 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆15Jan 31, 2023Updated 3 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆21Jul 13, 2023Updated 2 years ago
- Easy black-box access to state-of-the-art language models☆16Jun 7, 2023Updated 2 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105May 20, 2022Updated 3 years ago
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆29May 30, 2023Updated 2 years ago
- ☆69May 1, 2025Updated 11 months ago
- ☆10Oct 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Materials for PyCon 2016 in Portland, Oregon☆10Aug 30, 2015Updated 10 years ago
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Apr 28, 2022Updated 3 years ago
- ☆15May 18, 2021Updated 4 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- Official TensorFlow code for the paper "DeepWay: a Deep Learning Waypoint Estimator for Global Path Generation".☆11Jun 24, 2022Updated 3 years ago
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆13Oct 22, 2021Updated 4 years ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated last month
- A simple script to create geo-tagged image chips from high-resolution RS images for training deep learning models such as U-net.☆14Jun 29, 2021Updated 4 years ago
- Fact checking baseline combining dense retrieval and textual entailment☆30Aug 10, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Sep 10, 2024Updated last year
- Transformer based Trigram Blocking implementation in Tensorflow☆11Feb 26, 2020Updated 6 years ago
- Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.☆186Jan 10, 2023Updated 3 years ago
- ☆15May 30, 2017Updated 8 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Codebase for probing and visualizing multilingual models.☆49May 13, 2020Updated 5 years ago
- ☆14Updated this week