dell-research-harvard / AmericanStoriesLinks
The official Github for the American Stories dataset as in {link}
☆127Updated last year
Alternatives and similar repositories for AmericanStories
Users that are interested in AmericanStories are comparing it to the libraries listed below
Sorting:
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆34Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 3 years ago
- Package to extract connotation frames☆91Updated 2 years ago
- Making Patent Citations Uncool Again☆112Updated 2 years ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆28Updated 11 months ago
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆42Updated last month
- Code for the CUP Elements on text analysis in Python for social scientists☆138Updated 3 years ago
- The Harvard USPTO Patent Dataset☆79Updated 2 years ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆29Updated last year
- Nesta's Skills Extractor Library☆150Updated 6 months ago
- Text-Based Ideal Points☆46Updated 2 years ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated 2 years ago
- code base for constructing narrative statements from text☆116Updated 2 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆102Updated last year
- Code for measuring novelty in science using publication text☆32Updated 9 months ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆133Updated last month
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆90Updated 2 years ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- A simple toolkit for conducting analyses using corpus methods☆27Updated 4 years ago
- This repository contains data of TikTok videos related to the 2024 U.S. Elections☆32Updated 10 months ago
- Code for the paper "Content Analysis of Textbooks via Natural Language Processing".☆62Updated 2 years ago
- ☆61Updated last week
- potato: portable text annotation tool☆357Updated 3 weeks ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆48Updated 3 years ago
- Noise-robust de-duplication at scale☆19Updated 2 years ago
- ☆55Updated last year
- Neural Language Models for Historical Research☆29Updated last year
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago