dell-research-harvard / AmericanStoriesLinks
The official Github for the American Stories dataset as in {link}
☆121Updated last year
Alternatives and similar repositories for AmericanStories
Users that are interested in AmericanStories are comparing it to the libraries listed below
Sorting:
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆42Updated last month
- Package to extract connotation frames☆86Updated last year
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 10 months ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆48Updated 2 years ago
- potato: portable text annotation tool☆340Updated this week
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆180Updated last month
- Tools to train and explore diachronic word embeddings from Big Historical Data☆25Updated 5 months ago
- This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine…☆19Updated 2 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆87Updated 8 months ago
- HDBSCAN Tuning for BERTopic Models☆48Updated 2 years ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆125Updated 3 months ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated 2 years ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆87Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago
- Neural Language Models for Historical Research☆26Updated 9 months ago
- Nesta's Skills Extractor Library☆140Updated last month
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Making Patent Citations Uncool Again☆110Updated 2 years ago
- Noise-robust de-duplication at scale☆20Updated 2 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆136Updated 2 years ago
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆53Updated last year
- This is my 2024 course for TAP Institute on Vector Databases and Semantic Searching.☆12Updated 11 months ago
- Open data of Cofacts collaborative fact-checking database☆50Updated last year
- ☆11Updated 7 months ago
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- Code for measuring novelty in science using publication text☆30Updated 4 months ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- code base for constructing narrative statements from text☆110Updated last year
- The Harvard USPTO Patent Dataset☆69Updated last year
- Fast, flexible extraction of moral information from textual input data.☆110Updated 2 years ago