pmgraham / datagruntLinks
Datagrunt is a Python library designed to simplify the way you work with CSV files. It provides a streamlined approach to reading, processing, and transforming your data into various formats, making data manipulation efficient and intuitive.
☆9Updated this week
Alternatives and similar repositories for datagrunt
Users that are interested in datagrunt are comparing it to the libraries listed below
Sorting:
- Data API and micro orm for DuckDB and MotherDuck☆10Updated 6 months ago
- API for distributing Data Lake Data☆11Updated 3 months ago
- Use cases examples using Versions☆11Updated 2 months ago
- The code to follow along our tutorials for the dlt rest_api source☆10Updated last year
- Data Quality Monitor (DQM) - Continuously validate your data with easy, customizable rules.☆37Updated last year
- ☆24Updated 5 months ago
- A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store☆19Updated 3 months ago
- Boiling Insights - From raw S3 data to charts in seconds☆19Updated 7 months ago
- ☆8Updated 11 months ago
- ☆12Updated last year
- dApp authentication with Amazon Cognito and Web3 proxy with Amazon API Gateway☆13Updated 11 months ago
- Build a directory full of files into a SQLite database☆12Updated last year
- HTTPFS extension for DuckDB. Adds support for an HTTPFileSytem and S3FileSystem.☆18Updated 8 months ago
- duckdb-etl-framework☆12Updated 6 months ago
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆43Updated 2 weeks ago
- Chaos Engineering Framework across Private / Public / Hybrid Cloud Environments☆13Updated last year
- This Script gets CSV file from Amazon S3 using Python Library Boto3 and converts it to Parquet Format before uploading the new Parquet Ve…☆9Updated 2 months ago
- A collection of DuckDB queries for tutorial/getting started purpose☆37Updated 4 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆26Updated 2 weeks ago
- ☆10Updated 10 months ago
- A collection of real-time detection methods built with Tinybird. Methods include rate-of-change, out-of-range, timeout, Z-score, and Inte…☆11Updated last year
- ☆11Updated 5 months ago
- DuckDB WebMacro: Share and Load your SQL Macros via gists☆12Updated 7 months ago
- Samples, tutorials, and demos for Conversational AI on Google Cloud☆73Updated last month
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24Updated last year
- aws-solutions-library-samples / guidance-for-text-generation-using-embeddings-from-enterprise-data-on-awsThis Guidance demonstrates question answering using Retrieval Augmented Generation (RAG) with foundation models in Amazon SageMaker JumpS…☆10Updated 8 months ago
- Mastering Long Document Insights: Advanced Summarization with Amazon Bedrock and Anthropic Claude Foundation Model☆15Updated last year
- ☆21Updated 4 months ago
- Lecture notes, scripts, and material for the lecture of Selected Statistics Topics in the Autonomous University of Querétaro☆12Updated 8 months ago
- A curated list of Mintlify resources and examples.☆15Updated 8 months ago