dell-research-harvard / AmericanStories
The official Github for the American Stories dataset as in {link}
☆109Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for AmericanStories
- code base for constructing narrative statements from text☆96Updated last year
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 2 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆84Updated last year
- Package to extract connotation frames☆80Updated 11 months ago
- Innovation across ages☆66Updated last year
- Making Patent Citations Uncool Again☆108Updated last year
- ☆82Updated 6 months ago
- A Package for Cantonese Tokenisation☆17Updated 3 years ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆105Updated 5 months ago
- Noise-robust de-duplication at scale☆15Updated last year
- A python package to enrich Twitter Data☆74Updated last year
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for devel…☆48Updated 8 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆77Updated 3 months ago
- Raw text of 申報☆19Updated 2 years ago
- Code for measuring novelty in science using publication text☆15Updated 3 weeks ago
- ☆53Updated 10 months ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆19Updated last month
- This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.☆57Updated 7 months ago
- potato: portable text annotation tool☆298Updated 3 weeks ago
- Powerful topic model visualization in Python☆103Updated 2 months ago
- A simple toolkit for conducting analyses using corpus methods☆24Updated 3 years ago
- Blazing fast topic modelling for short texts.☆31Updated last month
- Text-Based Ideal Points☆44Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago
- ☆30Updated 4 months ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 2 months ago
- ☆160Updated last year
- ☆71Updated 5 months ago