[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
☆105Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for TweebankNLP
Users that are interested in TweebankNLP are comparing it to the libraries listed below
Sorting:
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated last month
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆49Aug 20, 2024Updated last year
- Easy black-box access to state-of-the-art language models☆16Jun 7, 2023Updated 2 years ago
- ☆69May 1, 2025Updated 10 months ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 2 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- NLP Examples using the 🤗 libraries☆40Feb 21, 2021Updated 5 years ago
- ☆10Oct 2, 2024Updated last year
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆605Jul 22, 2024Updated last year
- Official TensorFlow code for the paper "DeepWay: a Deep Learning Waypoint Estimator for Global Path Generation".☆11Jun 24, 2022Updated 3 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- Replication code for "The Structure of Toxic Conversations on Twitter" (WWW'21)☆10May 25, 2021Updated 4 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆15Jan 31, 2023Updated 3 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆13Oct 22, 2021Updated 4 years ago
- A UI automation engine☆11Aug 14, 2025Updated 6 months ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105May 20, 2022Updated 3 years ago
- Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)☆28Mar 26, 2022Updated 3 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Sep 10, 2024Updated last year
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆30May 30, 2023Updated 2 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 4 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Feb 26, 2020Updated 6 years ago
- A repository to keep tools, scripts, data for SMART task.☆11May 24, 2022Updated 3 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-storage-transfer☆12Sep 21, 2023Updated 2 years ago
- Structural Supervision & Human Psycholinguistic Data☆12Apr 16, 2021Updated 4 years ago
- Materials for PyCon 2016 in Portland, Oregon☆10Aug 30, 2015Updated 10 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Aug 13, 2025Updated 6 months ago
- LinkedIn Web Scraper☆10Mar 3, 2021Updated 5 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- A Simulator for Traffic Intersection based on Crossroads technique☆10Dec 4, 2019Updated 6 years ago
- Speaker diarization and speech to text☆14Dec 17, 2020Updated 5 years ago
- Code for "SEE-Few: Seed, Expand and Entail for Few-shot Named Entity Recognition", accepted at COLING 2022.☆12Nov 25, 2022Updated 3 years ago