rberenguel/pyspark-arrow-pandas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rberenguel/pyspark-arrow-pandas)

rberenguel / pyspark-arrow-pandas

Presentation about Pyspark and how Arrow makes it faster

☆22

Alternatives and similar repositories for pyspark-arrow-pandas

Users that are interested in pyspark-arrow-pandas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

conda-forge / arrow-cpp-feedstock
View on GitHub
A conda-smithy repository for arrow-cpp.
☆12Updated this week
singingwolfboy / webhookdb
View on GitHub
Replicates GitHub's database via HTTP webhooks
☆16Oct 15, 2015Updated 10 years ago
drabastomek / learningPySpark_video
View on GitHub
Learning PySpark video series
☆11Mar 5, 2018Updated 8 years ago
joelparkerhenderson / social-value-orientation
View on GitHub
Social value orientation (SVO) notes for pro-social pro-self concepts
☆13Apr 14, 2025Updated last year
tecton-ai / apply-workshop-2022
View on GitHub
☆17Aug 5, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jupyterhub / simpervisor
View on GitHub
Simple Python3 Supervisor library
☆14Jul 1, 2026Updated 2 weeks ago
RahulBhalley / favorite-research-papers
View on GitHub
Listing my favorite research papers 📝 from different fields as I read them.
☆10Oct 17, 2019Updated 6 years ago
dedupeio / doublemetaphone
View on GitHub
Python wrapper for a C++ Double Metaphone
☆15Jan 12, 2026Updated 6 months ago
xhochy / libfuzzymatch
View on GitHub
C++11 library for fast fuzzy searching
☆15Jun 9, 2015Updated 11 years ago
xhochy / fletcher
View on GitHub
Pandas ExtensionDType/Array backed by Apache Arrow
☆232Feb 22, 2023Updated 3 years ago
altendky / graham
View on GitHub
Graham, making s'mores with attrs and marshmallow.
☆12Sep 24, 2024Updated last year
datasciencecampus / pprl_toolkit
View on GitHub
The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.
☆16May 22, 2024Updated 2 years ago
yennanliu / spark-etl-pipeline
View on GitHub
Various data stream/batch process demo with Apache Scala Spark 🚀
☆12Feb 28, 2020Updated 6 years ago
lihaoyi / autojit
View on GitHub
☆13Mar 23, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CSLDepend / exploits
View on GitHub
We store attacks and exploits that we've found useful in our research
☆13Jun 4, 2015Updated 11 years ago
OpenITI / openiti
View on GitHub
python library
☆13Nov 25, 2025Updated 7 months ago
SrdjanLL / SensibleNILM
View on GitHub
Energy disaggregation - Deep learning approach.
☆11Feb 2, 2018Updated 8 years ago
waynegraham / photoscan_scripts
View on GitHub
Scripts for the Python API in PhotoScan
☆17Sep 1, 2015Updated 10 years ago
ONSdigital / ons-crow
View on GitHub
This repository contains CROW, the Clerical Resolution Online Widget, an open-source project designed to help data linkers with their cle…
☆11Updated this week
tcharding / self_learning
View on GitHub
Text books and programming problem websites
☆12Apr 22, 2026Updated 2 months ago
serebrov / udacity_hadoop_intro
View on GitHub
Notes and tasks code for Cloudera / Udacity hadoop course
☆16Jul 31, 2015Updated 10 years ago
ajinabraham / Xenotix-xBOT
View on GitHub
Xenotix xBOT is a Cross Platform PoC Bot that abuse certain Google Services to implement it's C&C
☆28Jun 18, 2018Updated 8 years ago
Query-farm / pyroscope
View on GitHub
DuckDB Pyroscope Extension for Continuous Profiling
☆21Feb 18, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
brianray / bokeh_roc_slider
View on GitHub
Receiver operating characteristic chart in Bokeh
☆14Sep 2, 2019Updated 6 years ago
manovotny / chrome-developer-tools-skins
View on GitHub
Custom skins for Chrome Developer Tools
☆21May 30, 2013Updated 13 years ago
alan-turing-institute / defoe
View on GitHub
Code to analyse books and newspapers data using Apache Spark.
☆16Feb 11, 2022Updated 4 years ago
bbvadata / bbvadata_papers
View on GitHub
☆12Apr 2, 2018Updated 8 years ago
databricks-industry-solutions / auto-data-linkage
View on GitHub
Low effort linking and easy de-duplication. Databricks ARC provides a simple, automated, lakehouse integrated entity resolution solution …
☆54Oct 28, 2024Updated last year
OpenTechSchool / sql-tutorial
View on GitHub
Beginners' tutorial on how to extract information from databases with SQL
☆22Aug 29, 2017Updated 8 years ago
sayurin / optipng-zopfli
View on GitHub
☆18Mar 4, 2013Updated 13 years ago
softapalvelin / kobo-grive-sync
View on GitHub
Scripts for automatic Google Drive synchronization on Kobo Touch. Depends on kobo-grive.
☆18Jan 23, 2014Updated 12 years ago
xhochy / scrobbler
View on GitHub
(RETIRED) Scrobbler is a wrapper for the audioscrobbler (last.fm) web services.
☆21Apr 7, 2012Updated 14 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
jake-swanson / isituppy
View on GitHub
Custom slash command to use isitup.org to check if a site is up from within Slack
☆12Mar 29, 2016Updated 10 years ago
PublicSectorAPI / listing
View on GitHub
A crowdsourced list of public sector API
☆12May 8, 2015Updated 11 years ago
Hotell / skate-starter
View on GitHub
skatejs 5 + typescript + webpack
☆11May 26, 2020Updated 6 years ago
data-engineering-collective / minimalkv
View on GitHub
A minimal key-value store interface for binary data (maintained fork of simplekv).
☆17Updated this week
lcdm-uiuc / ML-SQL
View on GitHub
The ML-SQL repository was created to explore and research a SQL-like language for Machine Learning.
☆16Sep 1, 2016Updated 9 years ago
CartoDB / bigmetadata
View on GitHub
[ARCHIVED] Historical bigmetadata project - no longer maintained
☆43Jan 5, 2026Updated 6 months ago
tomahawk-player / tomahawk-contrib
View on GitHub
Third-party contributions to Tomahawk
☆21Feb 5, 2016Updated 10 years ago