Three little Python scripts for data preparation: remove commas, add commas, concatenate files
☆16Jul 26, 2017Updated 8 years ago
Alternatives and similar repositories for data-prep-minitoolkit
Users that are interested in data-prep-minitoolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Personal website for Rachael. Based on https://academicpages.github.io, which is a fork from mistakes/minimal-mistakes☆12Jan 13, 2023Updated 3 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- Demo server for TREC LiveQA competition☆11Dec 7, 2016Updated 9 years ago
- Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"☆13Nov 26, 2024Updated last year
- Code for the ILNewsDiff Twitter account☆10May 23, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Praat script for automatic formant optimization☆15Jan 27, 2023Updated 3 years ago
- Exploring NAs in a small rodent dataset☆16May 19, 2020Updated 5 years ago
- Distinguishing between anime and hentai☆16Jan 29, 2017Updated 9 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- The Second Place Solution of Topcoder: Neptune - Facial Marathon Match☆11Aug 20, 2019Updated 6 years ago
- Run delayed and risky choice (DARC) experiments using Bayesian Adaptive Design☆10Sep 12, 2019Updated 6 years ago
- ☆14Oct 21, 2018Updated 7 years ago
- New York Times Word Innovation Types dataset☆21Dec 1, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆44Feb 11, 2026Updated last month
- Content for tutorial on using jspsych at the 2015 Cognitive Science Society meeting☆12Jul 10, 2015Updated 10 years ago
- Source code of the Scalable Brain Atlas website (scalablebrainatlas.incf.org)☆15Mar 18, 2016Updated 10 years ago
- Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"☆29Jun 2, 2021Updated 4 years ago
- tools for phoneticians and phonologists☆32Dec 5, 2018Updated 7 years ago
- ☆15Feb 11, 2019Updated 7 years ago
- Calculate mean of pairwise weighted distances between points using great circle metric.☆11Jul 6, 2023Updated 2 years ago
- Documentation Assets☆12Jul 5, 2023Updated 2 years ago
- ☆14Jun 27, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 2nd place submission to the MEG decoding competition https://www.kaggle.com/c/decoding-the-human-brain☆18Aug 5, 2014Updated 11 years ago
- Video encoding & classification using tensorflow 2.0☆10Nov 12, 2019Updated 6 years ago
- Code for the TrackML competition on Kaggle☆16Jul 2, 2020Updated 5 years ago
- Useful decorators every Data Scientist should know☆29Nov 30, 2022Updated 3 years ago
- Cookiecutter for community-maintained Jupyter Docker images☆17Mar 2, 2026Updated 3 weeks ago
- A set of general tools.☆15Mar 6, 2026Updated 2 weeks ago
- Fast smoothing spline routine in Fortran 90 usable in python☆15Dec 9, 2015Updated 10 years ago
- Code for the Data Science Bowl 2018 competition on Kaggle☆11Apr 18, 2018Updated 7 years ago
- convert model from mxnet to caffe without lossing precision☆19Jul 3, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simulations to illustrate what neural networks learn.☆13Jun 12, 2020Updated 5 years ago
- Part of our solution to PLAsTiCC Kaggle challenge☆18Dec 27, 2018Updated 7 years ago
- ☆11Aug 19, 2015Updated 10 years ago
- Predicts brain age, based on data from Freesurfer 5.3☆10Mar 16, 2026Updated last week
- 15th solution for Walmart Recruiting: Trip Type Classification☆12Jan 27, 2016Updated 10 years ago
- 8th place solution for SIIM-ACR Pneumothorax Segmentation competition on Kaggle☆22Dec 24, 2021Updated 4 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆33Mar 26, 2023Updated 2 years ago