Code snippets for data acquisition and organization in data science.
☆22Jun 3, 2016Updated 9 years ago
Alternatives and similar repositories for data-science
Users that are interested in data-science are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆62Jan 15, 2019Updated 7 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- Jupyter Notebook tips and tricks for the Berkeley Institute for Data Science lecture. http://bids.berkeley.edu/☆28Jan 20, 2016Updated 10 years ago
- simple ansible playbook to take clean ubuntu 18.04 to CUDA 10, PyTorch 1.0, fastai, miniconda heaven☆12Dec 16, 2018Updated 7 years ago
- Exploration of spark streaming based on the BigData.be project 2☆15Sep 2, 2013Updated 12 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.☆30Jun 29, 2014Updated 11 years ago
- Part-of-speech tagger for the English language☆10Jul 31, 2018Updated 7 years ago
- ☆12Dec 19, 2016Updated 9 years ago
- A Java binding for MeCab☆11Nov 24, 2020Updated 5 years ago
- ☆13Apr 23, 2017Updated 8 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Dec 11, 2014Updated 11 years ago
- A paper comparing Dask and Spark☆10Dec 9, 2022Updated 3 years ago
- Dataiku DSS plugin template with continuous integration. Test your plugins, release them faster ⚡️☆11Sep 23, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The code that I used in Click-Through Rate Prediction (http://www.kaggle.com/c/avazu-ctr-prediction/) (C++). It implements the Follow The…☆12Mar 2, 2015Updated 11 years ago
- Contains information and instructions for the first Data Mining lab session for 2017 Fall.☆14Oct 7, 2018Updated 7 years ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Jan 31, 2018Updated 8 years ago
- R coding and notes☆47Jun 25, 2022Updated 3 years ago
- Question Answering system based on Skip-Thought Memory Networks☆17Mar 25, 2020Updated 6 years ago
- R package for accessing the StatisticsNZ API☆10Feb 20, 2023Updated 3 years ago
- Applied Machine Learning projects from my site: http://www.appliedprogramming.net/machine-learning/home.html☆15Jul 23, 2020Updated 5 years ago
- Python code to seasonally adjust data using the census X12-ARIMA program: http://www.census.gov/srd/www/x12a/☆11Mar 22, 2012Updated 14 years ago
- Legoo: A collection of automation modules to build analytics infrastructure☆20Jul 24, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Webapp for fetching vehicle positions from a SIRI Vehicle Monitoring (VM)-feed and displaying a map via Leaflet w/ plugins.☆18Feb 2, 2017Updated 9 years ago
- Solution Accelerator: Using Logic Apps & Form Recognizer☆15Sep 22, 2023Updated 2 years ago
- BlackOut and Adaptive Softmax for language models by Chainer☆11Oct 20, 2017Updated 8 years ago
- DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)☆23Dec 12, 2017Updated 8 years ago
- This repo contains all the codes and sample files for the "Short and Long-term Pattern Discovery Over Large-Scale Geo-Spatiotemporal Data…☆13May 19, 2022Updated 3 years ago
- Lazily regularized updates for Adagrad with sparse features. Implemented in Cython for efficiency.☆11Jan 2, 2021Updated 5 years ago
- Create hadoop cluster in aws ec2 for development☆11Sep 8, 2017Updated 8 years ago
- A demo for VR-SGD(Comparing to some state-of-the-art algorithms).☆13Mar 5, 2018Updated 8 years ago
- Implement GRU or CNN in python/theano to learn the sentences representation for coherence, answer selecting or dialogue.☆15Apr 22, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Simple A/B testing framework for Python☆27Nov 7, 2011Updated 14 years ago
- Language model using an RNN in PyTorch☆18Jun 3, 2018Updated 7 years ago
- A starting point for GTD implementation. This describes a general approach to implementing Getting Things Done by David Allen, but using…☆11Sep 3, 2024Updated last year
- Most recent/important talks given at conferences/meetups☆14Nov 27, 2020Updated 5 years ago
- R interface to Kusto/Azure Data Explorer. Submit issues and PRs at https://github.com/Azure/AzureKusto☆18Oct 13, 2023Updated 2 years ago
- Kaggle Competition BNP Pairbas Cardif Claims Management: Rank 133 out of 2,926 (Top 5%)☆14May 10, 2016Updated 9 years ago
- Visual programming solution No-Code☆21Sep 10, 2024Updated last year