dell-research-harvard / AmericanStories
The official Github for the American Stories dataset as in {link}
☆112Updated 10 months ago
Alternatives and similar repositories for AmericanStories:
Users that are interested in AmericanStories are comparing it to the libraries listed below
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆83Updated last year
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated last year
- code base for constructing narrative statements from text☆100Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago
- A CWN Python binding with graph structure☆27Updated last year
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆25Updated last week
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆28Updated 10 months ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆109Updated 7 months ago
- Open data of Cofacts collaborative fact-checking database☆49Updated last year
- Code for measuring novelty in science using publication text☆19Updated 3 weeks ago
- ☆54Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆109Updated last month
- BLOOM-zh is a modification from BLOOM. BLOOM-zh is trained extendedly on larger amounts of Traditional Chinese text data while it still m…☆10Updated last year
- Noise-robust de-duplication at scale☆15Updated last year
- A Package for Cantonese Tokenisation☆17Updated 3 years ago
- Innovation across ages☆67Updated last year
- Making Patent Citations Uncool Again☆110Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆159Updated 7 months ago
- CKIP CoreNLP Toolkits☆118Updated last year
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆12Updated 5 years ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 4 months ago
- Package to extract connotation frames☆81Updated last year
- Ethnicolr implementation with new models in pytorch☆10Updated last month
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- ☆35Updated 6 years ago
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆136Updated last year
- Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…☆210Updated this week
- A BERT-based application for reusable text classification at scale☆37Updated last year