Butch78 / 1BillionRowChallenge
I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried implementing a solution in Python & Rust using mainly polars
☆14Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for 1BillionRowChallenge
- Cloud Benchmarker automates performance testing of cloud instances, offering insightful charts and tracking over time.☆33Updated last year
- ☆36Updated last month
- Time series forecasting with DuckDB and Evidence☆36Updated 3 weeks ago
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆39Updated 9 months ago
- Access llamafile localhost models via LLM☆14Updated 7 months ago
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (g…☆25Updated 2 weeks ago
- A dev container with ollama and ollama examples with the Python OpenAI SDK☆42Updated 3 months ago
- A Python library for real-time PostgreSQL event-driven cache invalidation.☆18Updated 7 months ago
- Example usages of the Scaffoldly toolchain.☆13Updated last week
- A CLI tool for managing OpenAI batch processing jobs with ease.☆27Updated 2 months ago
- Find Python Packages on PyPI with the help of vector embeddings☆42Updated 4 months ago
- DuckDB Community Extension to prompt LLMs from SQL☆22Updated last week
- Streamable multi-format serialization with schema☆23Updated 2 months ago
- Object-oriented data visualization library with integrated data analysis and style management☆13Updated this week
- Prototyping a question and answer bot over PDFs☆38Updated last year
- Compression suite for data frames and tabular data files, csv, excel etc. Using LZHW algorithm.☆30Updated 3 months ago
- ☆26Updated 2 months ago
- ☆21Updated 3 weeks ago
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.☆24Updated this week
- Ez API, ez life.☆23Updated last month
- GPT-4o-Realtime based AI Podcast Generator☆19Updated last month
- Have a function doing stuff too long? Just distribute!☆15Updated 4 months ago
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Updated 9 months ago
- Explore Crime in Toronto by Neighbourhood.☆12Updated 7 months ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆12Updated 2 years ago
- Create embeddings for LLM using the Nomic API☆16Updated 7 months ago
- recipes for BASH, Docker and more☆13Updated 6 months ago
- Tailored cloud solutions based on use case, cost, and preferences using natural language with Agentic AI to research, design, price, diag…☆18Updated 3 weeks ago
- CLI for running files through AWS Textract☆53Updated 7 months ago
- pglineage is a tool to create data flow diagrams for PostgreSQL by analyzing SQL☆15Updated 7 months ago