Butch78 / 1BillionRowChallenge
I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried implementing a solution in Python & Rust using mainly polars
☆14Updated last year
Alternatives and similar repositories for 1BillionRowChallenge:
Users that are interested in 1BillionRowChallenge are comparing it to the libraries listed below
- Time series forecasting with DuckDB and Evidence☆39Updated 4 months ago
- Cloud Benchmarker automates performance testing of cloud instances, offering insightful charts and tracking over time.☆35Updated last year
- ☆37Updated 2 weeks ago
- rust-for-data☆44Updated last year
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆91Updated 5 months ago
- Create embeddings for LLM using the Nomic API☆22Updated 4 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆27Updated this week
- Analyzing hacker news in real-time with Bytewax and Proton☆39Updated last year
- Portfolio rebalancing tool for investors☆17Updated 7 months ago
- Ipython notebook copy of Andrej Karpathy's llama2.c☆23Updated last year
- Prototyping a question and answer bot over PDFs☆39Updated last year
- ☆27Updated 6 months ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated last year
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPower☆15Updated this week
- ☆24Updated 4 months ago
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆39Updated last year
- Template to quickstart streaming analytics using Apache Kafka for ingestion, QuestDB for time-series storage and analytics, Grafana for n…☆83Updated 3 months ago
- Ez API, ez life.☆24Updated 5 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 6 months ago
- Streamable multi-format serialization with schema☆22Updated 3 months ago
- ☆10Updated last year
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆14Updated 2 years ago
- A Python library for real-time PostgreSQL event-driven cache invalidation.☆21Updated last month
- Explore Crime in Toronto by Neighbourhood.☆12Updated 11 months ago
- Python Script for Structuring data from SEC Form D filings using DuckDB and Python with a display layer using Evidence☆28Updated 7 months ago
- Web crawler for Burplist, a search engine for craft beers in Singapore☆14Updated this week