EleutherAI / datasetsView external linksLinks
π€ The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
β12Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- A python web scraper built on Selenium to gather profile data from okcupid.comβ12Oct 15, 2022Updated 3 years ago
- This Sample iOS App demonstrates the vast capabilities of EnableX's WebRTC platform APIs and iOS Toolkit in building multi-party video coβ¦β11Jul 29, 2025Updated 6 months ago
- Crash and burn the Gibson to take out the Da Vinci virusβ12Dec 20, 2020Updated 5 years ago
- Unit support for numbersβ24Jun 9, 2015Updated 10 years ago
- A project to render the entire Monero blockchain as fractal art.β12Oct 19, 2020Updated 5 years ago
- β11Mar 19, 2022Updated 3 years ago
- Simple bot for commenting on the stocks of items posted on /r/buildapcsales from microcenter.comβ13Jan 20, 2018Updated 8 years ago
- Notes from my research for forest-langβ12Feb 28, 2022Updated 3 years ago
- A gopher daemonβ12May 16, 2022Updated 3 years ago
- threejs mandelbrot viewerβ10Mar 4, 2023Updated 2 years ago
- Learn to play guitar, bass, piano, synthesizer or drums using MIDI files.β14Jun 23, 2022Updated 3 years ago
- A chatbot for touristsβ11Feb 8, 2019Updated 7 years ago
- (Work in progress) Code in C# for an educational game that uses MIDI within the Unity 3D game engine. Ultimately intended to replicate meβ¦β12Aug 21, 2018Updated 7 years ago
- info-beamer hosted package SDKβ11Dec 2, 2025Updated 2 months ago
- β21Jan 23, 2016Updated 10 years ago
- Deprecated, see: https://github.com/librato/statsd-librato-backendβ15Aug 30, 2012Updated 13 years ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"β10Oct 28, 2019Updated 6 years ago
- Snoop on Linux VFS I/O using bpftraceβ11Jan 6, 2026Updated last month
- A Guitar Hero cloneβ11Jul 13, 2019Updated 6 years ago
- β22Oct 18, 2025Updated 3 months ago
- Scraper for PhET Science & Math Interactive Simulationsβ13Updated this week
- A tool to make Duck Duck Go easier to use on Androidβ23Jan 12, 2019Updated 7 years ago
- automatic data race analysis for Linux device driversβ12Jul 27, 2016Updated 9 years ago
- OSM mirror - full stackβ18Feb 15, 2017Updated 8 years ago
- image sharpening algorithmβ10Jul 9, 2018Updated 7 years ago
- Cuda-based matrix/vector computationsβ12Feb 25, 2020Updated 5 years ago
- Basic starter pack of voice commands for use with Talon Voiceβ11Mar 7, 2020Updated 5 years ago
- consume data from Environment and Climate Change Canadaβ13Jul 20, 2020Updated 5 years ago
- A Ruby library for interacting with the awesome javascript SourceMaps.β39Sep 5, 2017Updated 8 years ago
- β10Dec 19, 2020Updated 5 years ago
- Mastodon bot for generating Blast HardCheese-like namesβ10Oct 24, 2017Updated 8 years ago
- A wrapper around PStore to make persisting Ruby objects as easy as possible.β25Aug 1, 2016Updated 9 years ago
- Generate an RSS feed for Patreon postsβ12Jan 21, 2024Updated 2 years ago
- A collection of papers on reinforcement learning applied to NLPβ14Sep 7, 2018Updated 7 years ago
- Georgia Tech - OMSCS - CS7641 - Machine Learning Repositoryβ12Nov 25, 2019Updated 6 years ago
- A LibreOffice Calc extension that clusters the rows in a table and colors them to indicate the clusters.β11Aug 11, 2025Updated 6 months ago
- A curated list of Natural Language Generation papers, tutorials, and blogs.β12Dec 13, 2018Updated 7 years ago
- Browser game inspired by Guitar Heroβ13Aug 8, 2016Updated 9 years ago
- A concise open code of conduct for live Algorave eventsβ16Jun 14, 2019Updated 6 years ago