Code for "Training models when data doesn't fit in memory" post
☆13Jun 14, 2020Updated 5 years ago
Alternatives and similar repositories for big-data-ml-training
Users that are interested in big-data-ml-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- curated list of awesome open source repositories for data pipelining and machine learning in production.☆17Dec 1, 2019Updated 6 years ago
- Toronto Machine Learning MLflow Workshop☆26Jun 7, 2021Updated 4 years ago
- Streamlit dashboard of StarTrek character interactions☆10Dec 4, 2022Updated 3 years ago
- An effort to make a grammar checker using rule-based approach.☆10Apr 21, 2021Updated 5 years ago
- A T5 based sequence generation model for WikiSQL task. Achieving 90.3% on test data set using sequence generation.☆17Nov 11, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆16Jun 11, 2024Updated last year
- Place for collecting great projects/capstones from previous springboard students.☆54Aug 19, 2021Updated 4 years ago
- A Data Blind Approach to the popular Semantic Parsing task NL2SQL☆17Dec 4, 2020Updated 5 years ago
- This is a repository for Job recommendation system☆12Mar 26, 2018Updated 8 years ago
- Learn Machine Learning using PySpark from scratch☆20Nov 27, 2018Updated 7 years ago
- An example project demonstrating how to perform OCR with multi-modal LLMs☆10Mar 14, 2024Updated 2 years ago
- ☆19Apr 2, 2020Updated 6 years ago
- ☆24Apr 4, 2025Updated last year
- ☆14May 25, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Examples for using the dedupe library☆10Feb 22, 2016Updated 10 years ago
- ☆25Nov 21, 2022Updated 3 years ago
- ☆12Jul 28, 2020Updated 5 years ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Aug 5, 2019Updated 6 years ago
- ☆19Aug 23, 2024Updated last year
- A Conversational Speech Generation Model☆14Mar 16, 2025Updated last year
- "Forrest Gump" data release IO package☆14May 6, 2021Updated 5 years ago
- a simple machine learning pipeline built using Apache AirFlow☆15Nov 22, 2022Updated 3 years ago
- ☆23Feb 27, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14May 3, 2023Updated 3 years ago
- A simple and fast search engine☆70Jun 21, 2022Updated 3 years ago
- ☆35Oct 9, 2020Updated 5 years ago
- CorrectLy - Open Source Spelling & Grammar correction☆44Dec 7, 2022Updated 3 years ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Nov 18, 2024Updated last year
- Document Classification on COVID-19 Literature using the LitCovid collection and the Hedwig library.☆16Oct 26, 2024Updated last year
- babyLM WhisBERT code☆19May 27, 2024Updated last year
- ☆20Oct 15, 2024Updated last year
- GUI useful to manually annotate text for Named Entity Recognition purposes☆14Jun 22, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for Stop&Hop, a method for learning to classify irregularly-sampled time series early☆19Oct 3, 2024Updated last year
- Building a multi-label classifier for toxic comment classification☆19Feb 16, 2018Updated 8 years ago
- Probabilistic Key Value pair extraction using word weights from Invoices - Non Searchable PDF☆16Jun 12, 2021Updated 4 years ago
- ☆20Mar 8, 2024Updated 2 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆21Jun 30, 2023Updated 2 years ago
- Norwegian Speech Transformer Models☆19Mar 26, 2026Updated last month
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago