The official Github for the American Stories dataset as in {link}
☆129Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for AmericanStories
Users that are interested in AmericanStories are comparing it to the libraries listed below
Sorting:
- ☆13Dec 3, 2017Updated 8 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆22Aug 14, 2025Updated 6 months ago
- Noise-robust de-duplication at scale☆19Apr 9, 2023Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- A slides collection for Go Conference 2016 Spring☆10Nov 6, 2017Updated 8 years ago
- OLAMI API Quickstart cURL Samples (in bash)☆11Jan 26, 2018Updated 8 years ago
- Swagger file for leetcode API☆12Oct 26, 2021Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- ☆12Dec 6, 2024Updated last year
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Mulidimensional hyperbolic embeddings of real networks☆20May 15, 2025Updated 9 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- ☆14Feb 20, 2024Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- ☆19Sep 16, 2025Updated 5 months ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- A list of awesome machine question answering dataset - 機器問答數據集☆15Dec 24, 2019Updated 6 years ago
- This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and appl…☆19Jun 5, 2025Updated 9 months ago
- Dataset of temperatures in 6 rooms of my home recorded once per minute☆16Oct 20, 2018Updated 7 years ago
- ☆20Jul 22, 2021Updated 4 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated 3 weeks ago
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Mar 15, 2021Updated 4 years ago
- Animal Adoption Platform by Django☆44Feb 13, 2014Updated 12 years ago
- ☆18Updated this week
- An Omeka S module for the transcription of structured data.☆16Nov 5, 2025Updated 4 months ago
- ⚙️Tool for NLP - handle file and text☆15Feb 16, 2025Updated last year
- Python code for producing emotionality scores from Gennaro and Ash (2021).☆20Dec 12, 2021Updated 4 years ago
- ISCC - Codec & Algorithms☆23Mar 2, 2026Updated last week
- Common tools for data processing☆22Dec 8, 2025Updated 3 months ago
- Text pattern search using marisa-trie☆18Jan 26, 2025Updated last year
- ☆18Nov 8, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- FHIR Ontop OMOP Implementation☆20May 8, 2024Updated last year
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated last month
- Materials for "Language Models for Law and Social Science" (ETH Zurich), Spring 2024☆29May 6, 2024Updated last year
- Multilingual Open Text☆25May 8, 2025Updated 10 months ago