A library for reading text files over multiple cores.
☆1,053Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for paratext
Users that are interested in paratext are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,756Dec 8, 2025Updated 4 months ago
- Visual profiler for Python☆3,980Jul 15, 2022Updated 3 years ago
- Bringing the python data stack to the shell prompt☆792Feb 1, 2021Updated 5 years ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,171Aug 30, 2021Updated 4 years ago
- A columnar data container that can be compressed.☆960Oct 27, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Dec 28, 2021Updated 4 years ago
- ☆1,569Nov 3, 2021Updated 4 years ago
- TrailDB is an efficient tool for storing and querying series of events☆1,091Jan 24, 2021Updated 5 years ago
- dplyr for python☆761Dec 30, 2016Updated 9 years ago
- A data science IDE for Python☆3,897Apr 16, 2018Updated 8 years ago
- Write reproducible reports in Markdown☆440Dec 21, 2018Updated 7 years ago
- Compiled, automatically parallel Python for data science☆489Mar 25, 2017Updated 9 years ago
- Plotting library for IPython/Jupyter notebooks☆3,688Updated this week
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆245Mar 26, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Partitioned storage system based on blosc. **No longer actively maintained.**☆157Nov 21, 2016Updated 9 years ago
- Open source time series library for Python☆2,141Oct 24, 2023Updated 2 years ago
- Tools for exploratory data analysis in Python☆649Aug 5, 2025Updated 8 months ago
- Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) …☆4,396Mar 2, 2020Updated 6 years ago
- Data Migration for the Blaze Project☆1,005Jul 15, 2022Updated 3 years ago
- A Pandas Styler class for making beautiful tables☆414Jan 8, 2023Updated 3 years ago
- A library for defensive data analysis.☆502Jan 6, 2020Updated 6 years ago
- Python helpers for building dashboards using Flask and React☆2,269Jun 2, 2025Updated 11 months ago
- A probabilistic programming language in TensorFlow. Deep generative models, variational inference.☆4,842Mar 18, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,048Sep 11, 2025Updated 7 months ago
- Declarative visualization library for Python☆10,355Updated this week
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,081May 22, 2019Updated 6 years ago
- Library for fast text representation and classification.☆26,519Mar 22, 2024Updated 2 years ago
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,895Sep 16, 2022Updated 3 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,150Dec 31, 2020Updated 5 years ago
- knitpy: Elegant, flexible and fast dynamic report generation with python☆367Apr 25, 2021Updated 5 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆54Jul 3, 2018Updated 7 years ago
- Open Machine Intelligence Framework for Hackers. (GPU/CPU)☆5,545Mar 20, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆3,171Nov 16, 2021Updated 4 years ago
- Import C++ files directly from Python!☆1,226Apr 17, 2026Updated 2 weeks ago
- Fast, flexible and easy to use probabilistic modelling in Python.☆3,525Mar 6, 2025Updated last year
- Instructions for setting up the software on your deep learning machine☆1,966Aug 23, 2018Updated 7 years ago
- SFrame: Scalable tabular and graph data-structures built for out-of-core data analysis and machine learning.☆902Sep 30, 2018Updated 7 years ago
- Kafka-based Job Queue for Python☆574Feb 4, 2022Updated 4 years ago
- Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.☆6,893Jul 4, 2024Updated last year