jmankoff/data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jmankoff/data)

jmankoff / data

The repository for the CMU Data Pipeline course. This year's course should use branch 2017

☆40

Alternatives and similar repositories for data

Users that are interested in data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

matomo-org / plugin-VisitorGenerator
View on GitHub
Plugin to create fake visits, websites, users and goals to populate Matomo reports
☆22Updated this week
skrusche63 / spark-weblog
View on GitHub
Implementation of Web Log Analysis in Scala and Apache Spark
☆10Feb 8, 2015Updated 11 years ago
xiaorancs / feature-select
View on GitHub
featselector是一个基于统计分析和模型选择的特征选择器.
☆14Mar 4, 2019Updated 7 years ago
nitish6174 / video-search-engine
View on GitHub
Flask-based application using MySQL, MongoDB and Neo4j for storing video data and provides interface to search video and show related vid…
☆11Apr 23, 2017Updated 9 years ago
amir-rahnama / pyspark-twitter-stream-mining
View on GitHub
Real-time Machine Learning with Apache Spark on Twitter Public Stream
☆69Apr 27, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bythebay / pipeline
View on GitHub
Complete Pipeline Training at Big Data Scala By the Bay
☆71Oct 27, 2015Updated 10 years ago
cs224 / pybnl
View on GitHub
python interface to bnlearn and other probabilistic graphical model libraries
☆10Mar 26, 2020Updated 6 years ago
CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
View on GitHub
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Mar 9, 2018Updated 8 years ago
shree-ranga / DSTakehomeChallenges
View on GitHub
Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.
☆10Nov 15, 2017Updated 8 years ago
AndreyBozhko / TaxiOptimizer
View on GitHub
My Data Engineering project @ Insight Data Science
☆10Jul 23, 2018Updated 8 years ago
mdlindsey / DealerData
View on GitHub
Open-source software for tracking and analyzing CarMax vehicle data
☆13May 29, 2018Updated 8 years ago
NoRaincheck / TreeGrad
View on GitHub
Differentiable Tree Ensembles
☆21May 25, 2026Updated 2 months ago
LangYujian / DataScience
View on GitHub
☆10May 10, 2017Updated 9 years ago
liguigui / kamyu104-LeetCode
View on GitHub
☆13Sep 30, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
5agado / knowledge-extraction
View on GitHub
From Natural Language Text to Graph Database
☆31Mar 3, 2016Updated 10 years ago
benb116 / PennCourseSearch
View on GitHub
A web app designed to help Penn students find classes and make schedules
☆13Oct 25, 2019Updated 6 years ago
fditraglia / econ722
View on GitHub
Notes and code for the second part of Econ 722 at UPenn
☆19Feb 2, 2021Updated 5 years ago
deniederhut / safe-handling-instructions-for-missing-data
View on GitHub
Code and data for SciPy 2018 talk on missing data
☆21Jun 29, 2018Updated 8 years ago
nickola / stock-prices-storage
View on GitHub
Simple storage for stock prices with adjusted prices calculation based on Center for Research in Security Prices (CRSP) standards
☆12Feb 15, 2018Updated 8 years ago
woniesong92 / humanjobs
View on GitHub
HumanJobs is a ChatGPT Plugin that lets ChatGPT create job postings only for humans
☆14Apr 15, 2023Updated 3 years ago
devnan / interview
View on GitHub
Interview record
☆15Mar 16, 2017Updated 9 years ago
llvm-mirror / klee
View on GitHub
Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project
☆17Dec 14, 2016Updated 9 years ago
hammer / avro-tools
View on GitHub
A collection of tools that help me work with Avro
☆23Jan 7, 2010Updated 16 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
seanbaxter / gtc_2014
View on GitHub
Companion source code for GTC 2014 talk
☆11Mar 25, 2014Updated 12 years ago
8080labs / showcases
View on GitHub
A collection of Python scripts
☆12Feb 7, 2020Updated 6 years ago
amilypku / Leetcode-SQL-rewrite-using-Python-
View on GitHub
This is a repository created by Lei Huang to record Leetcode SQL practice.
☆17Jun 27, 2020Updated 6 years ago
JonathanJohann / Independent_Research
View on GitHub
Welcome to my independent research repository!
☆17Nov 18, 2016Updated 9 years ago
Shao-Jie / SkimpyStash
View on GitHub
A key/value database based on SkimpyStash.
☆13Jun 11, 2015Updated 11 years ago
paramaggarwal / CarND-Traffic-Sign-Classifier-Project
View on GitHub
Classify Traffic Signs.
☆10Jan 31, 2017Updated 9 years ago
Alexoner / vehicle-assistance
View on GitHub
vehicle-assistance including lane and vehicle detection and track.
☆11Sep 29, 2013Updated 12 years ago
tomfaulhaber / geo-window
View on GitHub
Simple spatio-temporal windowing in Kafka Streams
☆13Jul 14, 2016Updated 10 years ago
nukui-s / tfautoencoder
View on GitHub
Auto Encoder on Tensorflow
☆12Oct 18, 2017Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
seguri / GetForegroundActivity
View on GitHub
Gets name of current foreground activity
☆13Jan 21, 2017Updated 9 years ago
MiuLab / Spk-Dialogue
View on GitHub
Speaker Role Contextual Model for Dialogues
☆15Sep 30, 2017Updated 8 years ago
NYUBigDataProject / SparkClean
View on GitHub
A Scalable Data Cleaning Library for PySpark.
☆29Apr 4, 2019Updated 7 years ago
epfl-labos / eagle
View on GitHub
☆13Jan 16, 2019Updated 7 years ago
UltravioletAnalytics / text-features
View on GitHub
Jupyter notebook containing code from text preprocessing blog post
☆10Nov 29, 2016Updated 9 years ago
MillionIntegrals / ESL
View on GitHub
Algorithms from the book "Elements of Statistical Learning", implemented in Python
☆13Mar 29, 2015Updated 11 years ago
HouJP / my-mllib
View on GitHub
The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…
☆16Jan 3, 2022Updated 4 years ago