Similarity encoding of dirty categorical variables (strings)
☆20Jan 22, 2019Updated 7 years ago
Alternatives and similar repositories for spark-dirty-cat
Users that are interested in spark-dirty-cat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the implementation of the Recursive Nearest (Neighbor) Agglomeration☆11Oct 9, 2020Updated 5 years ago
- Interactive parametric benchmarks in Python☆17Apr 18, 2021Updated 5 years ago
- Conda packages from flit information☆10Dec 10, 2021Updated 4 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Sep 11, 2019Updated 6 years ago
- Material for the practical of the DS3 course on "Representing and comparing probabilities with kernels"☆26Feb 5, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Public repo containing code to train, visualize, and evaluate semi-supervised topic models and baselines for regression/classification on…☆11Apr 15, 2020Updated 6 years ago
- RAMP packages: database, backend, frontend, utilities☆15Mar 18, 2024Updated 2 years ago
- Matrix Methods In Data Analysis, Signal Processing, And Machine Learning☆10Sep 2, 2018Updated 7 years ago
- Integration of Pydantic with Kedro.☆12Aug 5, 2024Updated last year
- LaTeX source code for the slides☆24Jul 15, 2021Updated 4 years ago
- Reproducible Self-Publishing - Demo Publications in the Most Common Formats☆14Nov 10, 2023Updated 2 years ago
- Accelerated Confergence for Counterfactual Learning to Rank☆17Jan 21, 2022Updated 4 years ago
- Curated collection of DE1's favorite kedro pieces.☆12Apr 5, 2024Updated 2 years ago
- SXL: Spatially explicit learning of geographic processes with auxiliary tasks☆15Nov 26, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python module to perform under sampling and over sampling with various techniques.☆38Apr 11, 2026Updated last week
- An example of graph embeddings for wikipedia page recommendations☆11Aug 26, 2021Updated 4 years ago
- An offline evaluation framework for sequence-based recommender systems☆13May 17, 2019Updated 6 years ago
- ☆16May 24, 2023Updated 2 years ago
- PyCodeHash is a generic data and code hashing library that facilitates downstream caching.☆13Jan 26, 2026Updated 2 months ago
- Machine learning with dataframes☆1,591Apr 10, 2026Updated last week
- KEN: Relational Data Embeddings☆34Jan 2, 2024Updated 2 years ago
- Reinforcement Learning Algorithms☆14May 28, 2018Updated 7 years ago
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆57Aug 9, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ModelXplore, a python based model exploration☆16Jun 1, 2018Updated 7 years ago
- Multimodal data loader compatible with pytorch and tensorflow☆12Aug 14, 2024Updated last year
- Mars craters detection and classification RAMP starting kit☆22Dec 10, 2018Updated 7 years ago
- Simplest example of flask, pandas and plotly.☆16Dec 29, 2015Updated 10 years ago
- A library to compute (multi)fractal dimensions of images written in Python.☆24Feb 15, 2018Updated 8 years ago
- Docker image for the Open Source Routing Machine (OSRM) osrm-backend☆18Nov 10, 2019Updated 6 years ago
- State-of-The-Art Rating-based RECOmmendation system: pytorch lightning implementation☆13Oct 10, 2023Updated 2 years ago
- Closed form Entropic OT for balanced and unbalanced Gaussians☆17Sep 29, 2021Updated 4 years ago
- ARCHIVED Generate Code from BNF Grammars☆12May 10, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 📌 Track & manage metadata, visualize & compare Kedro pipelines in a nice UI.☆18Aug 5, 2024Updated last year
- A toolkit of functions and classes to help build isometric games with Lua☆16Apr 21, 2025Updated 11 months ago
- ☆29Nov 29, 2019Updated 6 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- ☆14Sep 25, 2020Updated 5 years ago
- ☆11Jan 28, 2019Updated 7 years ago
- Install micromamba, and optionally create a base conda environment.☆10Apr 5, 2025Updated last year