dell-research-harvard / AmericanStoriesLinks
The official Github for the American Stories dataset as in {link}
☆120Updated last year
Alternatives and similar repositories for AmericanStories
Users that are interested in AmericanStories are comparing it to the libraries listed below
Sorting:
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆42Updated last month
- Code for measuring novelty in science using publication text☆27Updated 3 months ago
- Noise-robust de-duplication at scale☆19Updated 2 years ago
- code base for constructing narrative statements from text☆108Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago
- The Harvard USPTO Patent Dataset☆68Updated last year
- A Package for Cantonese Tokenisation☆18Updated 3 years ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆23Updated 4 months ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 9 months ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated 2 years ago
- Making Patent Citations Uncool Again☆110Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆119Updated 2 months ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆32Updated last year
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 6 months ago
- A CWN Python binding with graph structure☆31Updated 2 years ago
- Open data of Cofacts collaborative fact-checking database☆49Updated last year
- Natural Language Processing for Political Science☆20Updated 7 years ago
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆33Updated 2 months ago
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- BLOOM-zh is a modification from BLOOM. BLOOM-zh is trained extendedly on larger amounts of Traditional Chinese text data while it still m…☆9Updated 2 years ago
- Package to extract connotation frames☆85Updated last year
- Innovation across ages☆70Updated 2 years ago
- How are words loaded with meaning? Repository accompanying research by Alina Arseniev-Koehler and Jacob G. Foster, titled "Machine learn…☆41Updated last year
- This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.☆65Updated last year
- A shared repository for data cleaning scripts used for innovation data.☆33Updated 4 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆85Updated 7 months ago
- ☆36Updated 6 years ago
- HDBSCAN Tuning for BERTopic Models☆47Updated 2 years ago
- ☆22Updated last year