A tool to automatically infer columns data types in .csv files
☆37Jan 28, 2023Updated 3 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Jun 13, 2022Updated 3 years ago
- real time log event processing using spark, kafka & cassandra☆13Dec 4, 2014Updated 11 years ago
- Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools.☆39Jan 29, 2023Updated 3 years ago
- TrafficAdvisor: a Real-Time Traffic Monitoring System☆14Sep 10, 2018Updated 7 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Jul 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Example project for building scalable data pipelines with Kedro and Ibis.☆14Dec 10, 2025Updated 4 months ago
- ☆12Aug 10, 2025Updated 8 months ago
- Backup zfs snapshots to S3.☆14Oct 7, 2020Updated 5 years ago
- Insight Data Engineering Project☆15Jun 1, 2021Updated 4 years ago
- Overlapping-generations macroeconomic model for evaluating fiscal policy in the United States☆28Apr 4, 2026Updated 2 weeks ago
- ☆17May 22, 2024Updated last year
- macOS Artifact Intelligence Tool☆13Apr 30, 2019Updated 6 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Oct 26, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite 💡☆17Apr 7, 2025Updated last year
- sneakernet☆80Jun 17, 2010Updated 15 years ago
- Documents versions for France services. Maintained collaboratively by volunteer contributors.☆13Apr 13, 2026Updated last week
- Code base for the practitioner's guide to the ONC algorithm paper published with the Journal of Financial Data Science☆20Jun 8, 2023Updated 2 years ago
- IBM Applied Data Science Capstone Project☆22Jun 9, 2025Updated 10 months ago
- Example of an ELF parser to learn about the ELF format☆11Oct 6, 2024Updated last year
- EverLoader for EverSD☆12Jul 3, 2024Updated last year
- A Python utility that will transform an image of the Earth to a template that you can cut and assemble by yourself☆14Jul 1, 2024Updated last year
- JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.☆31Dec 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Login Area Finder: scans host/s for login panels☆14Sep 21, 2014Updated 11 years ago
- A software held to act as a charm to avert spam and bring good messages☆10Mar 30, 2024Updated 2 years ago
- ☆16May 10, 2023Updated 2 years ago
- Sulci is a French textmining toolkit based on Libération corpus and thesaurus.☆23Jul 21, 2022Updated 3 years ago
- Automatically discover any webmentions and send them after every production build☆27Mar 5, 2021Updated 5 years ago
- Un outil de travail multi-disciplinaire pour cultiver des rituels de pratiques en pair à pair.☆13Nov 25, 2025Updated 4 months ago
- Spark app to merge different schemas☆23Dec 21, 2020Updated 5 years ago
- Curated list of yield farms and tools 🤑☆12Jul 15, 2021Updated 4 years ago
- Parisian sidewalks☆13Mar 1, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Kalman Filter, Smoother, and EM Algorithm for Python☆12Sep 4, 2023Updated 2 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.☆32Aug 14, 2023Updated 2 years ago
- Using DuckDB with AWS Lambda to process Delta Lake data☆33Jan 26, 2025Updated last year
- exemplar code to download all option chains for a symbol using pyetrade (V1 Etrade API)☆11Sep 28, 2021Updated 4 years ago
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Jul 4, 2022Updated 3 years ago
- Python library for MIME type parsing, normalisation and grouping.☆13Nov 13, 2024Updated last year
- CTI-URLScan is a command line tool to enable analysts to search URLscan.io submissions. Pull screenshot and DOM content. As well as, auto…☆11Mar 2, 2021Updated 5 years ago