dell-research-harvard / AmericanStories
The official Github for the American Stories dataset as in {link}
☆116Updated last year
Alternatives and similar repositories for AmericanStories:
Users that are interested in AmericanStories are comparing it to the libraries listed below
- A model(ing framework) for sample efficient OCR☆57Updated last year
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆36Updated 2 months ago
- Noise-robust de-duplication at scale☆18Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆118Updated last month
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 6 months ago
- code base for constructing narrative statements from text☆106Updated last year
- ☆80Updated 9 months ago
- Making Patent Citations Uncool Again☆110Updated last year
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆31Updated last year
- Code for measuring novelty in science using publication text☆24Updated 3 weeks ago
- Innovation across ages☆69Updated 2 years ago
- Package to extract connotation frames☆83Updated last year
- This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.☆64Updated 11 months ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated last year
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆32Updated 2 weeks ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆22Updated last month
- A Flexible Deep Learning Approach to Fuzzy String Matching☆144Updated 5 months ago
- Code for the paper 'Conversations at Scale: Robust AI-led Interviews with a Simple Open-Source Platform'☆31Updated last month
- ☆21Updated last year
- ☆119Updated 2 months ago
- A curated list of digital things related to the field of Chinese studies.☆32Updated 4 years ago
- A Package for Cantonese Tokenisation☆17Updated 3 years ago
- Lectures and "flipped session" materials from my NYU DS "Text as Data" course, spring 2021☆136Updated 3 years ago
- Neural Language Models for Historical Research☆25Updated 5 months ago
- LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for devel…☆57Updated 3 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆165Updated 9 months ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆83Updated last year
- Nesta's Skills Extractor Library☆129Updated 4 months ago
- A python package to enrich Twitter Data☆75Updated last year