The repository for the CMU Data Pipeline course. This year's course should use branch 2017
☆40May 2, 2017Updated 9 years ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Plugin to create fake visits, websites, users and goals to populate Matomo reports☆21Apr 9, 2026Updated last month
- Implementation of Web Log Analysis in Scala and Apache Spark☆10Feb 8, 2015Updated 11 years ago
- A Real-time Apache log monitor using Kafka & Spark Streaming, with fake log generator.☆24Feb 19, 2020Updated 6 years ago
- Counting Twitter hashtags using Spark Streaming and Cassandra☆41Feb 16, 2015Updated 11 years ago
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Mar 2, 2016Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Dec 9, 2016Updated 9 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Apr 27, 2017Updated 9 years ago
- Genetic Algorithm Feature Engineering☆15Oct 3, 2017Updated 8 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Jun 23, 2016Updated 9 years ago
- This data analysis provided information for the March 6th, 2018, NYC Open Data Week event hosted by the Two Sigma Data Clinic, "The State…☆13Jan 9, 2025Updated last year
- python interface to bnlearn and other probabilistic graphical model libraries☆10Mar 26, 2020Updated 6 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- CEVAE with VampPrior☆11Jul 18, 2018Updated 7 years ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Differentiable Tree Ensembles☆21Feb 5, 2020Updated 6 years ago
- Freddie Mac Single Loan Data Analysis & Machine Learning (Regression / Classification)☆12Jun 11, 2017Updated 8 years ago
- Nonparametric estimators of the average treatment effect with doubly-robust confidence intervals and hypothesis tests☆20Jan 4, 2023Updated 3 years ago
- Open-source software for tracking and analyzing CarMax vehicle data☆13May 29, 2018Updated 7 years ago
- ☆13Sep 30, 2018Updated 7 years ago
- ☆15Aug 13, 2024Updated last year
- Notes and code for the second part of Econ 722 at UPenn☆18Feb 2, 2021Updated 5 years ago
- github upload file☆16Sep 20, 2016Updated 9 years ago
- Code and data for SciPy 2018 talk on missing data☆21Jun 29, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of Python scripts☆12Feb 7, 2020Updated 6 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Feb 18, 2014Updated 12 years ago
- Interview record☆15Mar 16, 2017Updated 9 years ago
- 编译语言实现模式例程☆11Nov 22, 2014Updated 11 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆17Dec 14, 2016Updated 9 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- Doing research on top of Jalangi☆12Sep 9, 2016Updated 9 years ago
- A key/value database based on SkimpyStash.☆13Jun 11, 2015Updated 10 years ago
- ☆10Dec 20, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Multithreaded HTTP Download Accelerator☆23Jul 27, 2014Updated 11 years ago
- Classify Traffic Signs.☆10Jan 31, 2017Updated 9 years ago
- ☆40Sep 3, 2015Updated 10 years ago
- This is a repository created by Lei Huang to record Leetcode SQL practice.☆17Jun 27, 2020Updated 5 years ago
- ChatGPT Chrome Extension using Reactjs and TailwindCSS☆11Jan 5, 2023Updated 3 years ago
- Parallel programs with OpenMPI☆10Apr 1, 2015Updated 11 years ago
- Sourcecode & CAD drawings of NimbRo-OP☆28Oct 30, 2012Updated 13 years ago