A collection of small corpuses of interesting data for the creation of bots and similar stuff.
☆5,096Jan 19, 2026Updated 4 months ago
Alternatives and similar repositories for corpora
Users that are interested in corpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple Python interface for Darius Kazemi's Corpora Project.☆124Feb 7, 2020Updated 6 years ago
- Tracery: a story-grammar generation library for javascript☆2,196Nov 3, 2024Updated last year
- Python port of Kate Compton's Tracery text expansion library.☆257Mar 8, 2024Updated 2 years ago
- National Novel Generation Month, 2015 edition.☆340Sep 30, 2023Updated 2 years ago
- Notebooks and other materials for Reading and Writing Electronic Text☆206Apr 10, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A small module meant for use in text generators that lets you filter strings for bad words.☆225Jun 26, 2023Updated 2 years ago
- National Novel Generation Month, 2016 edition.☆161Sep 30, 2023Updated 2 years ago
- RiTa: the generative language toolkit☆353Dec 14, 2022Updated 3 years ago
- I have this big list of links to text stuff that I like, so I thought I'd make it into a repository.☆72Feb 28, 2018Updated 8 years ago
- National Novel Generation Month, 2017 edition.☆185Sep 30, 2023Updated 2 years ago
- National Novel Generation Month, 2018 edition.☆112Sep 30, 2023Updated 2 years ago
- National Novel Generation Month, 2014 edition.☆256Sep 30, 2023Updated 2 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆234Apr 27, 2023Updated 3 years ago
- ☆81Dec 29, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- modest natural-language processing☆12,121Jun 9, 2026Updated last week
- RiTa: the generative language toolkit (in JS)☆268Dec 2, 2022Updated 3 years ago
- An informal syllabus for 'The Fundamentals of Computing' workshop at ITP, 2016☆16Aug 16, 2022Updated 3 years ago
- A bare-bones simulation-driven narrative framework☆86Dec 2, 2018Updated 7 years ago
- Tracery: a story-grammar generation library for javascript☆131Nov 18, 2024Updated last year
- National Novel Generation Month, 2019 edition.☆97Sep 30, 2023Updated 2 years ago
- Experiments conducted for NaNoGenMo 2014☆25Mar 4, 2024Updated 2 years ago
- Creative Coding: Generative Art, Data visualization, Interaction Design, Resources.☆14,918Jun 10, 2026Updated last week
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆322Sep 26, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics☆25,134Mar 22, 2026Updated 2 months ago
- The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.☆47,678Apr 18, 2024Updated 2 years ago
- A simple interface for the CMU pronouncing dictionary☆321Apr 17, 2026Updated 2 months ago
- Syllabus and example code for 7-week class at NYU/ITP☆48Mar 10, 2017Updated 9 years ago
- National Novel Generation Month, 2021 edition.☆44Sep 30, 2023Updated 2 years ago
- National Novel Generation Month. Because.☆184Sep 30, 2023Updated 2 years ago
- p5.js is a client-side JS platform that empowers artists, designers, students, and anyone to learn to code and express themselves creativ…☆23,737Updated this week
- Better twitterbots for all your friends~☆968Jun 25, 2018Updated 7 years ago
- A tool for authoring and implementing tracery grammars in Twine.☆37Jul 23, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Making Sense of Social Data / ITP Class☆31Nov 14, 2016Updated 9 years ago
- A chrome extension for remote performances on other people's computers☆49Mar 21, 2022Updated 4 years ago
- Lectures used in my pedagogy☆312Mar 9, 2026Updated 3 months ago
- Find whole sentences matching a regex in Project Gutenberg☆32Feb 5, 2023Updated 3 years ago
- Open source, experimental, and tiny tools roundup☆1,784Aug 13, 2024Updated last year
- Use Markov chain generators in Tracery/cheapbotsdonequick bots☆17Jul 6, 2018Updated 7 years ago
- Material for a class at SFPC☆74Apr 3, 2016Updated 10 years ago