dell-research-harvard / AmericanStories
The official Github for the American Stories dataset as in {link}
☆118Updated last year
Alternatives and similar repositories for AmericanStories
Users that are interested in AmericanStories are comparing it to the libraries listed below
Sorting:
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆41Updated 2 weeks ago
- Noise-robust de-duplication at scale☆19Updated 2 years ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated last year
- Code for measuring novelty in science using publication text☆26Updated 2 months ago
- code base for constructing narrative statements from text☆107Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆118Updated last month
- Innovation across ages☆69Updated 2 years ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆86Updated last year
- Open data of Cofacts collaborative fact-checking database☆49Updated last year
- A Package for Cantonese Tokenisation☆17Updated 3 years ago
- Package to extract connotation frames☆85Updated last year
- CKIP CoreNLP Toolkits☆122Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆145Updated 7 months ago
- An R package for Keyword Assisted Topic Models☆107Updated last month
- Making Patent Citations Uncool Again☆110Updated last year
- ☆36Updated 6 years ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆23Updated 3 months ago
- A CWN Python binding with graph structure☆31Updated 2 years ago
- 《民意調查資料分析的R實戰手冊》的語法檔及資料檔(2018五南出版)☆16Updated last year
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆83Updated 6 months ago
- Natural Language Processing for Political Science☆20Updated 7 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆113Updated 5 months ago
- Scripts to fit and explore word embedding models augmented with political metadata.☆25Updated 10 months ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆32Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆172Updated last week
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆20Updated 3 months ago
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago
- Analysis and visualization of Taiwan’s COVID-19 data☆28Updated last year
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 8 months ago
- Lectures and "flipped session" materials from my NYU DS "Text as Data" course, spring 2021☆136Updated 3 years ago