Butch78 / 1BillionRowChallenge
I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried implementing a solution in Python & Rust using mainly polars
☆14Updated last year
Alternatives and similar repositories for 1BillionRowChallenge
Users that are interested in 1BillionRowChallenge are comparing it to the libraries listed below
Sorting:
- Time series forecasting with DuckDB and Evidence☆39Updated 6 months ago
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆39Updated last year
- Demo that extends the FastUI example & adds database persistence☆13Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- Create embeddings for LLM using the Nomic API☆23Updated 5 months ago
- ☆24Updated last month
- ☆8Updated 10 months ago
- rust-for-data☆45Updated last year
- ☆37Updated 3 weeks ago
- Adding Marimo to Datasette☆20Updated last month
- Git scrapers for scraping the fediverse☆16Updated this week
- Quick overview of duckdb, pandas and polars through a simple data pipeline.☆14Updated last year
- Versatile Metrics Collection for Python☆19Updated last year
- Access llamafile localhost models via LLM☆19Updated last year
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 3 months ago
- Concatenated documentation for use with LLMs☆31Updated this week
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆33Updated last month
- Your new best friend. Puppy is the easiest way to get started with modern python on any platform, install packages in virtual environmen…☆51Updated last week
- Run transcriptions using the OpenAI Whisper API☆24Updated 6 months ago
- Have UV deal with all your Jupyter deps.☆25Updated 8 months ago
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆36Updated 3 weeks ago
- Cloud Benchmarker automates performance testing of cloud instances, offering insightful charts and tracking over time.☆35Updated last year
- Lightweight, open source, locally-hosted Modern Data Stack☆14Updated last month
- Load GitHub repository contents as LLM fragments☆37Updated this week
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- convert natural language into technical diagrams☆14Updated 5 months ago
- A Python library for real-time PostgreSQL event-driven cache invalidation.☆22Updated 3 weeks ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- Dockerized FastAPI wrapper around the recognize-anything image recognition models☆25Updated last year
- ☆12Updated last year