☆75Nov 19, 2022Updated 3 years ago
Alternatives and similar repositories for ZerO-initialization
Users that are interested in ZerO-initialization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 광운대학교 컴퓨터 비전 AI 경진대회 1등 솔루션입니다.☆15Oct 5, 2022Updated 3 years ago
- ☆16Nov 20, 2023Updated 2 years ago
- 🎖️ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition🎖️☆16Sep 19, 2023Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 25, 2026Updated last week
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 3 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Jun 14, 2020Updated 6 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- VS Code Extension for Kaggle☆23Dec 9, 2024Updated last year
- Code for the PAPA paper☆27Nov 8, 2022Updated 3 years ago
- Codebase for adaptive continual memory☆15Aug 15, 2023Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 3 years ago
- ☆16Jun 13, 2022Updated 4 years ago
- ☆47Oct 11, 2023Updated 2 years ago
- ☆14Jul 28, 2023Updated 2 years ago
- Parallelizing non-linear sequential models over the sequence length☆57Jun 23, 2025Updated last year
- SKT'22 AI Fellowship, 딥러닝 기반 흑백 이미지 컬러화 기술 개발☆13Jun 7, 2023Updated 3 years ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Nov 6, 2023Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Jun 22, 2023Updated 3 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jul 16, 2024Updated last year
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆29Oct 31, 2020Updated 5 years ago
- ☆10Dec 17, 2019Updated 6 years ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆253Sep 1, 2022Updated 3 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆81Aug 30, 2023Updated 2 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆35Apr 27, 2023Updated 3 years ago
- Kernel k Nearest Neighbors in R☆17Updated this week
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- Code base for SRSGD.☆27Mar 5, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆69Nov 6, 2023Updated 2 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆19Oct 12, 2024Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Jul 9, 2023Updated 2 years ago
- More than Just Words: Modeling Non-textual Characteristics of Podcasts☆26Nov 6, 2019Updated 6 years ago
- A library for unit scaling in PyTorch☆134Jul 11, 2025Updated 11 months ago
- ☆65Mar 22, 2023Updated 3 years ago
- Sancho McCann's PhD Thesis Research Code☆25Oct 12, 2017Updated 8 years ago