dell-research-harvard / AmericanStoriesLinks
The official Github for the American Stories dataset as in {link}
☆125Updated last year
Alternatives and similar repositories for AmericanStories
Users that are interested in AmericanStories are comparing it to the libraries listed below
Sorting:
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆127Updated 4 months ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆32Updated last year
- The Harvard USPTO Patent Dataset☆69Updated last year
- A Package for Cantonese Tokenisation☆18Updated 4 years ago
- Neural Language Models for Historical Research☆28Updated 10 months ago
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆91Updated 9 months ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆29Updated 11 months ago
- potato: portable text annotation tool☆349Updated last month
- code base for constructing narrative statements from text☆111Updated last year
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆121Updated 2 months ago
- legisTaiwan: An Interface to Access Taiwan Legislative API in R 台灣立法院國會系統 API☆42Updated 2 months ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆27Updated 7 months ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Nesta's Skills Extractor Library☆141Updated 2 months ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Updated 2 years ago
- Code for the paper 'Conversations at Scale: Robust AI-led Interviews with a Simple Open-Source Platform'☆41Updated 6 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆185Updated 3 months ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆117Updated 8 months ago
- Text-Based Ideal Points☆45Updated 2 years ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆137Updated 2 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Making Patent Citations Uncool Again☆111Updated 2 years ago
- This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine…☆20Updated 2 years ago
- Package to extract connotation frames☆87Updated last year
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated last year
- A shared repository for data cleaning scripts used for innovation data.☆33Updated 4 years ago
- ☆55Updated last year