A collection of small corpuses of interesting data for the creation of bots and similar stuff.
☆5,092Jan 19, 2026Updated 4 months ago
Alternatives and similar repositories for corpora
Users that are interested in corpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple Python interface for Darius Kazemi's Corpora Project.☆124Feb 7, 2020Updated 6 years ago
- Tracery: a story-grammar generation library for javascript☆2,199Nov 3, 2024Updated last year
- Python port of Kate Compton's Tracery text expansion library.☆257Mar 8, 2024Updated 2 years ago
- National Novel Generation Month, 2015 edition.☆340Sep 30, 2023Updated 2 years ago
- Notebooks and other materials for Reading and Writing Electronic Text☆206Apr 10, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A small module meant for use in text generators that lets you filter strings for bad words.☆225Jun 26, 2023Updated 2 years ago
- National Novel Generation Month, 2016 edition.☆161Sep 30, 2023Updated 2 years ago
- RiTa: the generative language toolkit☆353Dec 14, 2022Updated 3 years ago
- I have this big list of links to text stuff that I like, so I thought I'd make it into a repository.☆72Feb 28, 2018Updated 8 years ago
- National Novel Generation Month, 2017 edition.☆185Sep 30, 2023Updated 2 years ago
- National Novel Generation Month, 2018 edition.☆112Sep 30, 2023Updated 2 years ago
- National Novel Generation Month, 2014 edition.☆256Sep 30, 2023Updated 2 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆231Apr 27, 2023Updated 3 years ago
- ☆81Dec 29, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- modest natural-language processing☆12,093Feb 25, 2026Updated 2 months ago
- RiTa: the generative language toolkit (in JS)☆268Dec 2, 2022Updated 3 years ago
- An informal syllabus for 'The Fundamentals of Computing' workshop at ITP, 2016☆16Aug 16, 2022Updated 3 years ago
- A bare-bones simulation-driven narrative framework☆86Dec 2, 2018Updated 7 years ago
- Tracery: a story-grammar generation library for javascript☆131Nov 18, 2024Updated last year
- National Novel Generation Month, 2019 edition.☆97Sep 30, 2023Updated 2 years ago
- Experiments conducted for NaNoGenMo 2014☆25Mar 4, 2024Updated 2 years ago
- Creative Coding: Generative Art, Data visualization, Interaction Design, Resources.☆14,817Apr 1, 2026Updated last month
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆321Sep 26, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.☆47,635Apr 18, 2024Updated 2 years ago
- Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics☆25,068Mar 22, 2026Updated 2 months ago
- A simple interface for the CMU pronouncing dictionary☆321Apr 17, 2026Updated last month
- Syllabus and example code for 7-week class at NYU/ITP☆48Mar 10, 2017Updated 9 years ago
- National Novel Generation Month, 2021 edition.☆44Sep 30, 2023Updated 2 years ago
- National Novel Generation Month. Because.☆184Sep 30, 2023Updated 2 years ago
- p5.js is a client-side JS platform that empowers artists, designers, students, and anyone to learn to code and express themselves creativ…☆23,695Updated this week
- Better twitterbots for all your friends~☆968Jun 25, 2018Updated 7 years ago
- A tool for authoring and implementing tracery grammars in Twine.☆37Jul 23, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Making Sense of Social Data / ITP Class☆31Nov 14, 2016Updated 9 years ago
- A chrome extension for remote performances on other people's computers☆49Mar 21, 2022Updated 4 years ago
- Lectures used in my pedagogy☆310Mar 9, 2026Updated 2 months ago
- Find whole sentences matching a regex in Project Gutenberg☆32Feb 5, 2023Updated 3 years ago
- Open source, experimental, and tiny tools roundup☆1,776Aug 13, 2024Updated last year
- Use Markov chain generators in Tracery/cheapbotsdonequick bots☆17Jul 6, 2018Updated 7 years ago
- Material for a class at SFPC☆74Apr 3, 2016Updated 10 years ago