An index of all open-source data
☆4,826Oct 6, 2025Updated 7 months ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial repository of National Park Service data.☆1,256Apr 29, 2026Updated 2 weeks ago
- Assorted data from the General Services Administration.☆2,279Apr 17, 2024Updated 2 years ago
- ID3-based implementation of the ML Decision Tree algorithm☆1,481Oct 31, 2018Updated 7 years ago
- Cool links & research papers related to Machine Learning applied to source code (MLonCode)☆6,577Dec 3, 2020Updated 5 years ago
- Principal Component Analysis on music loops☆785May 11, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Large-scale linear classification, regression and ranking in Python☆1,776Jul 18, 2023Updated 2 years ago
- Ruby gem to calculate the similarity between texts using tf*idf☆779Feb 26, 2024Updated 2 years ago
- ☆5,995Nov 19, 2023Updated 2 years ago
- Data and code behind the articles and graphics at FiveThirtyEight☆17,361Feb 25, 2025Updated last year
- A tensorflow implementation of French-to-English machine translation using DeepMind's ByteNet .☆620Oct 8, 2021Updated 4 years ago
- A curated list of awesome computer vision resources☆23,258May 17, 2024Updated 2 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP)☆18,522May 11, 2026Updated last week
- Apache Hadoop☆15,545Updated this week
- A curated list of awesome deep learning applications in the field of computational biology☆1,976Nov 7, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reinforcement learning resources curated☆9,758May 25, 2023Updated 2 years ago
- fast.ai Courses☆5,735Aug 2, 2024Updated last year
- Simple tutorials using Google's TensorFlow Framework☆6,027Aug 20, 2023Updated 2 years ago
- Shōgun☆3,066Dec 19, 2023Updated 2 years ago
- ☆2,851Jun 22, 2024Updated last year
- From the basics to slightly more interesting applications of Tensorflow☆5,667Dec 11, 2021Updated 4 years ago
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays…☆9,991Jan 15, 2024Updated 2 years ago
- machine learning and deep learning tutorials, articles and other resources☆17,805Jun 12, 2024Updated last year
- A toolkit for developing and comparing reinforcement learning algorithms.☆37,202Mar 26, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TensorFlow - A curated list of dedicated resources http://tensorflow.org☆17,531Feb 8, 2026Updated 3 months ago
- scikit-learn: machine learning in Python☆66,096Updated this week
- A curated list of awesome Deep Learning tutorials, projects and communities.☆28,171May 26, 2025Updated 11 months ago
- PredictionIO, a machine learning server for developers and ML engineers.☆12,527Jan 9, 2021Updated 5 years ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆72,491May 12, 2026Updated last week
- StarCraft II Learning Environment☆8,284Jul 23, 2024Updated last year
- An Open Source Machine Learning Framework for Everyone☆195,139Updated this week
- Open Data Sources☆516May 8, 2018Updated 8 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆43,260Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A toolkit for making real world machine learning and data analysis applications in C++☆14,380May 7, 2026Updated last week
- All of our computational notebooks☆712Dec 16, 2021Updated 4 years ago
- Fun with the Social Security Administration's baby name data☆646Dec 7, 2022Updated 3 years ago
- Census Reporter is a Knight News Challenge-funded project to make it easier for journalists to write stories using information from the U…☆811Apr 22, 2026Updated 3 weeks ago
- The Open Source Data Science Masters☆26,110Dec 3, 2023Updated 2 years ago
- Source code for the CERN Open Data portal☆769May 11, 2026Updated last week
- Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and train…☆9,120May 12, 2026Updated last week