BFloat16 Fused Adam Operator for PyTorch
☆19Nov 16, 2024Updated last year
Alternatives and similar repositories for bf16_fused_adam
Users that are interested in bf16_fused_adam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework☆21Oct 25, 2023Updated 2 years ago
- ☆20Apr 1, 2024Updated 2 years ago
- Code for paper Evolving Connectivity for Spiking Neural Networks☆22Oct 23, 2023Updated 2 years ago
- ☆12Dec 22, 2024Updated last year
- ☆16Feb 6, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jun 21, 2024Updated last year
- Sapphire Yours - A 2D puzzle game☆10Nov 12, 2022Updated 3 years ago
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Jun 11, 2023Updated 2 years ago
- Pruning is all you need (hopefully)☆12Sep 7, 2022Updated 3 years ago
- ☆26Feb 20, 2026Updated last month
- ☆12Apr 26, 2024Updated last year
- Variational Autoencoder with non-euclidean (hyperbolic) latent space☆12Nov 25, 2022Updated 3 years ago
- An attempt at a SVD inpainting pipeline☆50Dec 24, 2023Updated 2 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Sep 3, 2024Updated last year
- Mixed precision training from scratch with Tensors and CUDA☆29May 14, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 11 months ago
- ☆16Dec 30, 2024Updated last year
- Multipack distributed sampler for fast padding-free training of LLMs☆207Aug 10, 2024Updated last year
- FFT for PyCuda and PyOpenCL. The package is deprecated and its functionality is merged into Reikna.☆37Feb 17, 2014Updated 12 years ago
- ☆17Aug 16, 2019Updated 6 years ago
- OLD REPOSITORY, new one at repo.rumpkernel.org/rumprun☆44Apr 13, 2015Updated 11 years ago
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆10Aug 14, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆14Mar 30, 2026Updated 2 weeks ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- ☆16Jan 3, 2023Updated 3 years ago
- ☆18Aug 24, 2024Updated last year
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- Model Predictive Controller for Autonomous Driving implemented using ROS and C++☆103Jun 14, 2020Updated 5 years ago
- A C++ fork/rewrite of the smhasher project to bring Murmurhash v.3 to the Linux shell and to the PHP scripting language.☆21Jul 25, 2011Updated 14 years ago
- JSON encoder and decoder for python written in C/C++☆10Jan 22, 2024Updated 2 years ago
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Mar 19, 2023Updated 3 years ago
- Visual Question Answering System☆11Nov 13, 2019Updated 6 years ago
- Inference code for LLaMA models☆41Mar 13, 2023Updated 3 years ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 9 months ago
- ☆14May 14, 2024Updated last year
- A device-independent random number generator☆18Apr 27, 2024Updated last year
- Minimilast Redis Client for Erlang☆19Jul 15, 2013Updated 12 years ago