A collection of small corpuses of interesting data for the creation of bots and similar stuff.
☆5,099Jan 19, 2026Updated 5 months ago
Alternatives and similar repositories for corpora
Users that are interested in corpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tracery: a story-grammar generation library for javascript☆2,197Nov 3, 2024Updated last year
- Python port of Kate Compton's Tracery text expansion library.☆257Mar 8, 2024Updated 2 years ago
- National Novel Generation Month, 2015 edition.☆340Sep 30, 2023Updated 2 years ago
- Notebooks and other materials for Reading and Writing Electronic Text☆206Apr 10, 2026Updated 2 months ago
- A small module meant for use in text generators that lets you filter strings for bad words.☆225Jun 26, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- National Novel Generation Month, 2016 edition.☆161Sep 30, 2023Updated 2 years ago
- RiTa: the generative language toolkit☆352Dec 14, 2022Updated 3 years ago
- I have this big list of links to text stuff that I like, so I thought I'd make it into a repository.☆72Feb 28, 2018Updated 8 years ago
- National Novel Generation Month, 2017 edition.☆185Sep 30, 2023Updated 2 years ago
- National Novel Generation Month, 2018 edition.☆112Sep 30, 2023Updated 2 years ago
- National Novel Generation Month, 2014 edition.☆256Sep 30, 2023Updated 2 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆234Apr 27, 2023Updated 3 years ago
- modest natural-language processing☆12,126Jun 23, 2026Updated last week
- RiTa: the generative language toolkit (in JS)☆267Dec 2, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A bare-bones simulation-driven narrative framework☆86Dec 2, 2018Updated 7 years ago
- Tracery: a story-grammar generation library for javascript☆131Nov 18, 2024Updated last year
- National Novel Generation Month, 2019 edition.☆97Sep 30, 2023Updated 2 years ago
- Experiments conducted for NaNoGenMo 2014☆25Mar 4, 2024Updated 2 years ago
- Creative Coding: Generative Art, Data visualization, Interaction Design, Resources.☆14,985Jun 10, 2026Updated 3 weeks ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆322Sep 26, 2017Updated 8 years ago
- Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics☆25,158Mar 22, 2026Updated 3 months ago
- The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.☆47,693Apr 18, 2024Updated 2 years ago
- A simple interface for the CMU pronouncing dictionary☆321Apr 17, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Syllabus and example code for 7-week class at NYU/ITP☆48Mar 10, 2017Updated 9 years ago
- National Novel Generation Month, 2021 edition.☆44Sep 30, 2023Updated 2 years ago
- National Novel Generation Month. Because.☆184Sep 30, 2023Updated 2 years ago
- p5.js is a client-side JS platform that empowers artists, designers, students, and anyone to learn to code and express themselves creativ…☆23,756Jun 24, 2026Updated last week
- Better twitterbots for all your friends~☆968Jun 25, 2018Updated 8 years ago
- Making Sense of Social Data / ITP Class☆31Nov 14, 2016Updated 9 years ago
- A chrome extension for remote performances on other people's computers☆49Mar 21, 2022Updated 4 years ago
- Find whole sentences matching a regex in Project Gutenberg☆32Feb 5, 2023Updated 3 years ago
- Open source, experimental, and tiny tools roundup☆1,789Aug 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Use Markov chain generators in Tracery/cheapbotsdonequick bots☆17Jul 6, 2018Updated 7 years ago
- Material for a class at SFPC☆74Apr 3, 2016Updated 10 years ago
- A simple example Twitter bot using NodeJS.☆223Jun 26, 2023Updated 3 years ago
- My NaNoGenMo project for 2014☆18Nov 30, 2014Updated 11 years ago
- A grunt init template for making Twitter bots, preloaded with some useful libs.☆60Oct 29, 2015Updated 10 years ago
- ☆83Jul 27, 2017Updated 8 years ago
- Repository for ITP Fall 2015 Course☆100Nov 4, 2016Updated 9 years ago