wilkens-teaching / info3350-f20View external linksLinks
Cornell INFO 3350: Text mining for history and literature, Fall 2020
☆10Jan 14, 2021Updated 5 years ago
Alternatives and similar repositories for info3350-f20
Users that are interested in info3350-f20 are comparing it to the libraries listed below
Sorting:
- A Python wrapper around the topic modeling functions of MALLET.☆105Nov 1, 2024Updated last year
- Keywords and phrases that can be used for identifying mental-health-related conversation on Twitter☆12Jun 18, 2020Updated 5 years ago
- ☆12Aug 14, 2019Updated 6 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- Two-Step Approach to OCR Post-Correction☆14May 24, 2024Updated last year
- Scrapes headlines from CNN and FOX, then has ChatGPT do cross-analysis☆11Apr 19, 2023Updated 2 years ago
- ☆14Oct 21, 2022Updated 3 years ago
- Probe how GPT-n performs on statutory reasoning☆10Sep 17, 2024Updated last year
- Documents the style side of the short-story Creative Writing LLM benchmark: we generated many short stories with a range of LLMs, then an…☆22Dec 18, 2025Updated last month
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆12Oct 1, 2019Updated 6 years ago
- Workshop "Analyzing Social Media Data" at the Big Data and Development Conference☆11Sep 11, 2023Updated 2 years ago
- Code related to "Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis" (EACL 2017)☆11Feb 5, 2018Updated 8 years ago
- Tropy plugin for exporting items into Omeka☆11Apr 20, 2023Updated 2 years ago
- This is an introduction to Chinese words segmentation using Jieba.☆14May 31, 2018Updated 7 years ago
- Geolocation Inference for Reddit☆12Jun 17, 2024Updated last year
- a repository containing the details of natural language inference dataset in Hindi☆14Dec 28, 2020Updated 5 years ago
- ☆13Jan 8, 2021Updated 5 years ago
- Teaching materials for the deep learning course.☆17Feb 2, 2026Updated 2 weeks ago
- Uncertain natural language inference☆15Jun 12, 2023Updated 2 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Dec 1, 2022Updated 3 years ago
- This code implements a basic, Twitter-aware tokenizer.☆12Feb 8, 2024Updated 2 years ago
- ☆12Apr 1, 2025Updated 10 months ago
- ☆14Apr 19, 2022Updated 3 years ago
- L&S 88-5 Connector Course to Data 8☆15Apr 12, 2018Updated 7 years ago
- VIAF via Python☆13Jun 3, 2025Updated 8 months ago
- ☆18Jan 27, 2026Updated 3 weeks ago
- Senior A.I. project to generate realistic news articles like those found on CNN, NYTimes, Fox News, etc. Future research will involve con…☆15Apr 26, 2019Updated 6 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 6 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 8 years ago
- This is the course repository for the Spring 2021 iteration of MACS 30123 "Large-Scale Computing for the Social Sciences" at the Universi…☆15May 25, 2021Updated 4 years ago
- Scrapes the web. Gets the news.☆13Sep 6, 2016Updated 9 years ago
- Data and code for Natural Language Inference with Multiple Premises☆13May 15, 2019Updated 6 years ago
- A large (>5k) collection of search questions asked about Coronavirus 🦠☆14Mar 21, 2020Updated 5 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- TikTok-Teller: A TikTok Video Scraping and Content Analysis Tool☆19Nov 20, 2023Updated 2 years ago
- Data and code for analyzing language associated with fictional characters.☆15Jan 6, 2018Updated 8 years ago
- Collections of english historical texts and data relating to them☆19Mar 24, 2021Updated 4 years ago