IAMAl / PyTorch4M1Links
PyTorch's full-scratch build and install for Apple Silicon
☆29Updated last year
Alternatives and similar repositories for PyTorch4M1
Users that are interested in PyTorch4M1 are comparing it to the libraries listed below
Sorting:
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆56Updated 3 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆134Updated 3 years ago
- Plugin for deploying MLflow models to TorchServe☆111Updated 2 years ago
- Fourth place solution to the "OpenVaccine: COVID-19 mRNA Vaccine Degradation Prediction" organized by Stanford University and Kaggle☆20Updated 4 years ago
- ☆74Updated 2 years ago
- PyTorch implementation of GLOM☆22Updated 3 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- This repository contains notebooks showing how to perform mixed precision training in tf.keras 2.0☆12Updated 5 years ago
- Productionize machine learning predictions, with ONNX or without☆65Updated last year
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Updated 2 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 7 months ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- Hierarchical Attention Transformers (HAT)☆56Updated last year
- ☆66Updated 3 months ago
- GPT, but made only out of MLPs☆89Updated 4 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Updated 2 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆71Updated 2 years ago
- Experiment management with Hydra and MLflow☆13Updated 4 years ago
- Core Utilities for NVIDIA Merlin☆19Updated 11 months ago
- Code for "Re-evaluating Word Mover’s Distance" (ICML 2022)☆39Updated 3 years ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆28Updated 4 years ago
- Implementation of N-Grammer in Flax☆17Updated 2 years ago
- Examples of using PyTorch hooks, as covered in my YouTube tutorial video.☆34Updated last year
- Experiments for the blog post "No, We Don't Have to Choose Batch Sizes As Powers Of 2"☆20Updated 3 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Updated 4 years ago
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Updated 3 years ago