dell-research-harvard/AmericanStories

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dell-research-harvard/AmericanStories)

dell-research-harvard / AmericanStories

The official Github for the American Stories dataset as in {link}

☆135

Alternatives and similar repositories for AmericanStories

Users that are interested in AmericanStories are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Living-with-machines / DiachronicEmb-BigHistData
View on GitHub
Tools to train and explore diachronic word embeddings from Big Historical Data
☆31Apr 18, 2026Updated 3 months ago
dell-research-harvard / NEWS-COPY
View on GitHub
Noise-robust de-duplication at scale
☆19Apr 9, 2023Updated 3 years ago
dell-research-harvard / effocr
View on GitHub
A model(ing framework) for sample efficient OCR
☆65Apr 7, 2023Updated 3 years ago
qurator-spk / neat
View on GitHub
Named entity annotation tool
☆28Jul 6, 2023Updated 3 years ago
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
relatio-nlp / relatio
View on GitHub
code base for constructing narrative statements from text
☆125Jan 13, 2026Updated 6 months ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
Harry-Chan / seq2seqlm-on-qg
View on GitHub
☆13Feb 9, 2022Updated 4 years ago
voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
DanielLin94144 / DUAL-textless-SQA
View on GitHub
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…
☆35Aug 10, 2023Updated 2 years ago
DEFI-COLaF / LADaS
View on GitHub
Layout Analysis Dataset with Segmonto (LADaS)
☆25May 29, 2026Updated 2 months ago
olami-developers / olami-api-quickstart-curl-samples
View on GitHub
OLAMI API Quickstart cURL Samples (in bash)
☆11Jan 26, 2018Updated 8 years ago
cneud / ocr-conversion
View on GitHub
Conversions between various OCR formats
☆84Feb 13, 2026Updated 5 months ago
jbollen / rise_and_fall_of_rationality_in_language
View on GitHub
Data and code for "Rise and Fall of Rationality in Language" by Marten Scheffer, Ingrid van de Leemput, Els Weinans, and Johan Bollen
☆17Oct 27, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
maxdotio / neural-solr
View on GitHub
Neural Solr = Solr 9 + Mighty Inference + Node
☆18Jun 9, 2022Updated 4 years ago
kylebutts / ssaggregate
View on GitHub
Create industry-level aggregates for shift-share IV following Borusyak, Hull, and Jaravel (2022)
☆28Nov 2, 2025Updated 8 months ago
JMSLab / Template
View on GitHub
Template for research repository using scons.
☆15Updated this week
DDMAL / IIIF-AV-player
View on GitHub
IIIF Audio/Video Player
☆14Oct 26, 2023Updated 2 years ago
stressosaurus / raw-data-google-ngram
View on GitHub
This will download and process the Google Ngram data.
☆25Nov 29, 2022Updated 3 years ago
elliottash / text_econ_2022
View on GitHub
Materials for PhD course on text data in economics
☆109Sep 12, 2023Updated 2 years ago
darinchristensen / conley-se
View on GitHub
Code to estimate Conley's Standard Errors in R
☆17Sep 2, 2019Updated 6 years ago
bldavies / nberwp
View on GitHub
R package containing data on NBER working papers
☆26Nov 20, 2022Updated 3 years ago
UChicago-pol-methods / plsc-40601-CI-ML
View on GitHub
Advanced Topics in Causal Inference PLSC 40601
☆37May 6, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
omeka-s-modules / Datascribe
View on GitHub
An Omeka S module for the transcription of structured data.
☆16May 1, 2026Updated 2 months ago
AlexandraKapp / 30daymapchallenge
View on GitHub
☆28Nov 30, 2020Updated 5 years ago
umatter / datahandling
View on GitHub
☆28Dec 10, 2025Updated 7 months ago
swiss-ai / parity-aware-bpe
View on GitHub
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [ACL 2026]
☆20Apr 18, 2026Updated 3 months ago
mcaceresb / tablefill
View on GitHub
Fill in LyX, LaTeX, and Markdown tables using placeholder system
☆13Jan 20, 2026Updated 6 months ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
skhiggins / ra_guide
View on GitHub
Guidelines for research assistants
☆39Aug 11, 2025Updated 11 months ago
umd-mith / ndnp_iiif
View on GitHub
convert NDNP data to IIIF
☆12Jun 7, 2016Updated 10 years ago
webis-de / scidata22-stereo-scientific-text-reuse
View on GitHub
☆11Dec 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
suzgunmirac / hupd
View on GitHub
The Harvard USPTO Patent Dataset
☆87Dec 14, 2023Updated 2 years ago
gojiplus / statqa
View on GitHub
Extract Stats Q/A from Tables With Provenance
☆26Dec 27, 2025Updated 7 months ago
JonnoB / enhance_ocod
View on GitHub
A library for working with the OCOD dataset for analysis of property in England and Wales owned by offshore companies
☆14May 13, 2026Updated 2 months ago
mattblackwell / gov-january-math-refresher
View on GitHub
☆10Dec 14, 2022Updated 3 years ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
generall / hnsw-python
View on GitHub
hnsw implemented by python
☆22Nov 28, 2019Updated 6 years ago
fspv / leetcode-swagger
View on GitHub
Swagger file for leetcode API
☆12Oct 26, 2021Updated 4 years ago