GateNLP/broad_twitter_corpus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GateNLP/broad_twitter_corpus)

GateNLP / broad_twitter_corpus

The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)

☆69

Alternatives and similar repositories for broad_twitter_corpus

Users that are interested in broad_twitter_corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CogComp / zoe
View on GitHub
Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.
☆43Jan 16, 2020Updated 6 years ago
CogComp / multirc
View on GitHub
Reasoning over Multiple Sentences (Multi-RC)
☆34May 20, 2020Updated 6 years ago
swiseman / neighbor-tagging
View on GitHub
☆16Oct 24, 2021Updated 4 years ago
mlukasik / rumour-classification
View on GitHub
Code to reproduce experiments from the EMNLP 2015 paper about Rumour Stance Classification with Gaussian Processes.
☆37May 23, 2016Updated 10 years ago
leondz / entity_recognition
View on GitHub
framework for doing NER and other types of entity recognition, in Python
☆68Jun 21, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ArthurSpirling / UTokyo-TextAsData
View on GitHub
Code, data and slides for the UTokyo "text as data" course (June 3-4, 2017)
☆11Jun 5, 2017Updated 9 years ago
erikavaris / tokenizer
View on GitHub
Tokenizer for Twitter and Reddit data
☆45Apr 14, 2019Updated 7 years ago
sean-chester / generalised-brown
View on GitHub
C++ implementation of Generalised Brown clustering and python scripts for feature generation
☆41Apr 8, 2016Updated 10 years ago
marekrei / mltagger
View on GitHub
Multi-level tagger
☆24May 4, 2018Updated 8 years ago
dgarcia-eu / SocialMediaDataAnalysis
View on GitHub
Online materials for Social Media Data Analysis at the University of Konstanz
☆10Oct 13, 2025Updated 9 months ago
cdcrabtree / nomine
View on GitHub
Classify names by gender, U.S. ethnicity, or leaf nationality
☆19Oct 13, 2018Updated 7 years ago
fbkarsdorp / twitter-workshop
View on GitHub
Workshop materials for scraping Twitter with Python
☆13May 25, 2016Updated 10 years ago
epfl-dlab / unfun
View on GitHub
Code and data for the AAAI'19 paper "Reverse-Engineering Satire, or 'Paper on Computational Humor Accepted Despite Making Serious Advance…
☆14Feb 22, 2023Updated 3 years ago
cttsai / illinois-cross-lingual-wikifier
View on GitHub
☆24Sep 28, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
leondz / emerging_entities_17
View on GitHub
Dataset for the Emerging & Novel Entity NER task (WNUT '17)
☆113May 9, 2022Updated 4 years ago
leondz / twokenize
View on GitHub
Python standalone tokenizer
☆14Nov 12, 2015Updated 10 years ago
mariask2 / PAL-A-tool-for-Pre-annotation-and-Active-Learning
View on GitHub
PAL: A tool for Pre-annotation and Active Learning
☆18Feb 1, 2021Updated 5 years ago
kmunger / Topic_Models
View on GitHub
Presentation for the NYU Data Lab December 2015
☆14Dec 2, 2015Updated 10 years ago
Aatlantise / syntactic-augmentation-nli
View on GitHub
Create augmentation examples from MultiNLI by subject-object inversion and passivizing.
☆17Feb 22, 2021Updated 5 years ago
Anterotesis / historical-texts
View on GitHub
Collections of english historical texts and data relating to them
☆19Mar 24, 2021Updated 5 years ago
brendano / mte
View on GitHub
MiTextExplorer - interactive browser of text and document covariates.
☆24Jun 17, 2015Updated 11 years ago
mayhewsw / pytorch-truecaser
View on GitHub
A simple neural truecaser written in pytorch and allennlp.
☆35Jun 17, 2024Updated 2 years ago
dimsum16 / dimsum-data
View on GitHub
Data for the DiMSUM shared task at SEMEVAL 2016
☆14Feb 8, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kenlimmj / fightin-words
View on GitHub
A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.
☆11May 26, 2026Updated last month
danieldeutsch / summarize
View on GitHub
☆12Nov 11, 2019Updated 6 years ago
DeFacto / WebCredibility
View on GitHub
Provides web credibility models (Likert scale) to assign a trustworthiness score to a given website.
☆11Sep 19, 2019Updated 6 years ago
Hyperparticle / LemmaTag
View on GitHub
A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …
☆34Apr 5, 2019Updated 7 years ago
ProjectTw / TwitteR2Mongo
View on GitHub
R Package to stream and analyze tweets using a mongodb
☆13Mar 1, 2016Updated 10 years ago
fghjorth / lqrps17
View on GitHub
Information about and materials for graduate course "Logic of Quantitative Research in Political Science" at the University of Copenhagen…
☆17Feb 14, 2017Updated 9 years ago
ClimbsRocks / data-formatter
View on GitHub
Takes raw csv input and formats it to be ready for neural networks
☆19Mar 22, 2016Updated 10 years ago
kbenoit / ITAUR-Short
View on GitHub
A Brief Introduction to Text Analysis Using R
☆15Oct 27, 2016Updated 9 years ago
USC-CSSL / TACIT
View on GitHub
We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components…
☆112Mar 27, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YahooArchive / YaraParser
View on GitHub
Yara K-Beam Arc-Eager Dependency Parser
☆57Apr 21, 2016Updated 10 years ago
AKSW / Mandolin
View on GitHub
❇️ The best modules for Markov Logic Networks condensed in one framework.
☆13Dec 20, 2017Updated 8 years ago
bltlab / seqscore
View on GitHub
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
☆23Jul 16, 2026Updated last week
mjpost / bin
View on GitHub
bin files
☆13Jan 30, 2025Updated last year
stapelberg / goturbopfor
View on GitHub
Teaching implementation of the TurboPFor integer compression algorithm
☆23Feb 5, 2019Updated 7 years ago
kg-construct / resources
View on GitHub
Resources for KGC: languages/tools/evaluation-systems
☆15Oct 13, 2020Updated 5 years ago
MilaNLProc / bertlang
View on GitHub
A web interface to understand language-specific BERT-models
☆18Apr 16, 2024Updated 2 years ago