In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"
☆28Sep 2, 2022Updated 3 years ago
Alternatives and similar repositories for zero-copy-model-loading
Users that are interested in zero-copy-model-loading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lua Toolbox website (deprecated)☆22Dec 22, 2016Updated 9 years ago
- Bringing flamegraphs into jupyter notebooks for performance diagnosis☆13Dec 16, 2019Updated 6 years ago
- Serverless Python with Ray☆59Oct 14, 2022Updated 3 years ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆50Nov 29, 2022Updated 3 years ago
- Triton Server Component for lightning.ai☆14Feb 15, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Solution Service Architecture☆25Jun 5, 2024Updated last year
- Unix stream tool using for Javascript and JSON☆16Feb 26, 2011Updated 15 years ago
- A convenient package for the lazy torch programmer to leave all your :cuda() calls as-is when running on CPU☆14Apr 27, 2015Updated 11 years ago
- ☆15Apr 9, 2022Updated 4 years ago
- Building a text generation web app using OpenAI's GPT-2 and Panel.☆10Nov 5, 2019Updated 6 years ago
- Quickest way to share everything about your research within a single app☆16Feb 1, 2024Updated 2 years ago
- ☆10Dec 12, 2023Updated 2 years ago
- Notebooks for the PyTorch course by @deeplizard.☆16Dec 3, 2019Updated 6 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TopK Algorithms Benchmark☆10Jul 16, 2019Updated 6 years ago
- Finding Overlapping Communities in Social Networks☆10Feb 12, 2014Updated 12 years ago
- UCSanDiegoX: DSE220x : Machine Learning Fundamentals Course☆16Oct 11, 2018Updated 7 years ago
- C++ implementation of a poker equity calculator as montecarlo simulation☆18Dec 5, 2019Updated 6 years ago
- Entity Resolution☆16Mar 5, 2024Updated 2 years ago
- ☆13May 8, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- Implementation of layer diffuse inference using refiners☆25Apr 25, 2024Updated 2 years ago
- Accelerating SPARQL Queries by Exploiting Hash-based Locality and Adaptive Partitioning☆10Jan 21, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Torch test module☆22Mar 8, 2016Updated 10 years ago
- Middlewares for chi that help you survive running out of memory☆11Dec 9, 2015Updated 10 years ago
- This project consists of implementations of several kNN algorithms for road networks (aka finding nearest points of interest) and the exp…☆11Apr 30, 2024Updated 2 years ago
- Bitonic sort using simd (avx/neon) instructions☆17Mar 14, 2022Updated 4 years ago
- Rust vector types without dependencies that enable quick lookups☆19Sep 25, 2020Updated 5 years ago
- Implementation of Parallel Breadth-First Search on Distributed Memory Systems☆10Dec 15, 2015Updated 10 years ago
- How to install CUDA & cuDNN for Machine Learning☆20Jul 1, 2024Updated last year
- Optimizing data-intensive systems in disaggregated data centers