Butch78 / 1BillionRowChallengeLinks
I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried implementing a solution in Python & Rust using mainly polars
β14Updated last year
Alternatives and similar repositories for 1BillionRowChallenge
Users that are interested in 1BillionRowChallenge are comparing it to the libraries listed below
Sorting:
- a cherishable saas pythonic reflex template for humans, cherryblossom-inspired perfection, as cherishable as cherry blossom πΈβ17Updated last year
 - Open Source Note GPT. Turn your photos and images into text notes (in obsidian)β95Updated 8 months ago
 - Cost Efficient Data Pipelines with DuckDBβ58Updated 5 months ago
 - β15Updated 8 months ago
 - Tools for LLM agents.β60Updated 10 months ago
 - Python port of part of the TypeAgent repoβ262Updated last week
 - A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.β44Updated 2 months ago
 - Serverless for data practitioners. The fastest β‘οΈ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter notβ¦β40Updated last year
 - β20Updated last year
 - β39Updated last year
 - β17Updated 2 years ago
 - Deploy production grade application to AWS and GCP in minutes.β62Updated 2 months ago
 - A dev container with ollama and ollama examples with the Python OpenAI SDKβ61Updated last year
 - β67Updated 11 months ago
 - Quick overview of duckdb, pandas and polars through a simple data pipeline.β13Updated 2 years ago
 - Heimdall is a data orchestration and job execution platformβ62Updated this week
 - A semantic search system for Airbnb listings in Stockholm, built with Superlinked and Qdrant. It leverages multi-attribute vector search β¦β24Updated 4 months ago
 - β86Updated 2 months ago
 - Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhereβ143Updated this week
 - Duckdb extension for parsing the metadata and contents of the embedded data mode in PowerBI pbix filesβ30Updated last week
 - β30Updated 9 months ago
 - β46Updated last year
 - Run transcriptions using the OpenAI Whisper APIβ26Updated last year
 - β51Updated last week
 - β92Updated this week
 - Intro to Polars Tutorialβ22Updated 2 years ago
 - Production-ready Python library for multi-provider LLM orchestrationβ38Updated 3 weeks ago
 - AutoStreamlit Studio is an intelligent assistant designed to streamline the creation of Streamlit applications. Whether you're a seasonedβ¦β18Updated last year
 - Notebooks that appear in our YouTube seriesβ26Updated 2 months ago
 - Python Script for Structuring data from SEC Form D filings using DuckDB and Python with a display layer using Evidenceβ28Updated last year