Csv2Hive is an useful CSV schema finder for the Big Data. It discovers automatically schemas in big CSV files, generates the 'CREATE TABLE' statements and creates Hive tables. You don't need to writes any schemas at all. Csv2Hive is a really fast solution for integrating the whole CSV files into your DataLake.
☆27Oct 13, 2017Updated 8 years ago
Alternatives and similar repositories for Csv2Hive
Users that are interested in Csv2Hive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Business Data Analysis by HiPIC of CalStateLA☆21Oct 26, 2018Updated 7 years ago
- Apache Zeppelin notebooks for Recommendation Engines using Keras and Machine Learning on Apache Spark☆32Nov 21, 2017Updated 8 years ago
- User-friendly HBase API for Scala☆15Nov 20, 2020Updated 5 years ago
- PySpark for Elastic Search☆55Mar 22, 2017Updated 9 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Aug 24, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated 2 months ago
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated last year
- Parses Facebook chat messages into Python objects to enable convenient analysis.☆10Jan 3, 2018Updated 8 years ago
- Social Context Analysis aNd Emotion Recognition☆12Jul 11, 2017Updated 8 years ago
- Sends public ip through e-mail. Command-line standalone.☆16Oct 16, 2016Updated 9 years ago
- Java code for Apache Nifi processors☆11Jun 5, 2017Updated 9 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 10 years ago
- the 64th base of rfc4648☆21Feb 28, 2019Updated 7 years ago
- ☆12May 11, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A NiFi client library for JVM languages☆13Mar 18, 2016Updated 10 years ago
- Keras implementation of the article "Solving internal covariate shift in deep learning with linked neurons"☆13Dec 8, 2017Updated 8 years ago
- DEPRECATED: Simple, fast user news feeds for Django☆52Jan 2, 2019Updated 7 years ago
- giter8 template for Scala projects using sbt☆39Nov 20, 2016Updated 9 years ago
- Simple command to colorize the stderr of a target program☆12Sep 20, 2017Updated 8 years ago
- ☆12Aug 29, 2015Updated 10 years ago
- A look at regulatory challenges and recommendation in the age of AI. Investigating topics like monopoly formation, machine learning audit…☆14Jun 7, 2019Updated 7 years ago
- A Scala Swing component that wraps javax.swing.JTree☆15Feb 4, 2013Updated 13 years ago
- Mainly on text documents. Implemented a Mini Search Engine using different algorithms and then summaried documents using lexrank.☆11Jan 19, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a skeleton of a Scala project with maven to start using Spark☆44May 2, 2015Updated 11 years ago
- This repository contains all the material for the MLTrain NIPS workshop☆10Dec 9, 2017Updated 8 years ago
- Python scripts to read a Portuguese Wikipedia XML dump file, parse it and generate plain text files.☆14Mar 12, 2014Updated 12 years ago
- Mirror of https://picoforge.int-evry.fr/cgi-bin/twiki/view/Gpucv/Web☆14Dec 10, 2015Updated 10 years ago
- Kaggle's AXA Driver Telematics Analysis☆10Mar 16, 2015Updated 11 years ago
- Machine Learning based model to predict Insurance Pure Premium☆13Jan 24, 2017Updated 9 years ago
- Lecture notes for the "Programming with Python" course I have taught in Spring 2015. at The University of Manchester☆10Dec 21, 2015Updated 10 years ago
- Introduction to Pandas, Scikit-Learn and Keras☆14Aug 27, 2019Updated 6 years ago
- 🌍 Configuration files for Jupyter features.☆11Nov 12, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Machine Learning for Internet of Things☆12Jul 24, 2019Updated 6 years ago
- ☆13Jun 30, 2019Updated 6 years ago
- Get creation time of files for any platform - no external dependencies☆15May 28, 2019Updated 7 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- A hypothetical proof-of-concept book recommendation system for Project Gutenberg, using Natural Language Processing.☆11Mar 17, 2016Updated 10 years ago
- Implementation of W3C's R2RML and Direct Mapping specifications☆10Oct 12, 2020Updated 5 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Apr 8, 2017Updated 9 years ago