TimeLMs: Diachronic Language Models from Twitter
☆113Mar 5, 2024Updated 2 years ago
Alternatives and similar repositories for timelms
Users that are interested in timelms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for TweetEval☆398Jul 8, 2022Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 4 years ago
- ☆10May 5, 2017Updated 9 years ago
- Probing task; contextual embeddings -> textual definitions (EMNLP19)☆12Apr 22, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- REST and STREAMING crawlers of Twitter (java)☆14May 7, 2018Updated 8 years ago
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆39Oct 14, 2025Updated 8 months ago
- Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)☆54Mar 15, 2022Updated 4 years ago
- ☆19Feb 7, 2020Updated 6 years ago
- ICML 18 workshop - A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal Diseases☆15Jul 18, 2018Updated 7 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆27Aug 25, 2024Updated last year
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Apr 23, 2023Updated 3 years ago
- Adaptive Passage Encoder for Open-domain Question Answering☆15Jun 1, 2021Updated 5 years ago
- Corpus of Online Medical EnTities: the cometA corpus☆51Mar 6, 2025Updated last year
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 10 months ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated last year
- X2Static embeddings☆15Jul 15, 2021Updated 4 years ago
- ☆14Aug 3, 2022Updated 3 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆30Jun 14, 2021Updated 5 years ago
- R package for 'Efficient Learning of Word Representations and Sentence Classification'☆45May 8, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Aug 31, 2022Updated 3 years ago
- Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".☆16Nov 21, 2024Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆75May 15, 2024Updated 2 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆97Oct 9, 2022Updated 3 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 5 years ago
- The official implementation of "BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?, ACL 2021 main …☆25May 28, 2023Updated 3 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Dec 15, 2021Updated 4 years ago
- Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text☆24Aug 15, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 5 years ago
- Stochastic gradient descent with model building☆27Feb 15, 2023Updated 3 years ago
- Python + JavaScript workaround for mturk's rejection of CSV files with Emoji☆18Mar 8, 2018Updated 8 years ago
- BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models https://arxiv.org/abs/2308.16458☆58Jul 31, 2025Updated 10 months ago
- PyTorch Implementation of Prompt-augmented Temporal Point Process for Streaming Event Sequence, NeurIPS 2023☆14Dec 9, 2023Updated 2 years ago
- Papers, code and datasets about Cross-lingual Word Embeddings☆21Jan 23, 2022Updated 4 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆75Dec 9, 2022Updated 3 years ago