☆75Nov 19, 2022Updated 3 years ago
Alternatives and similar repositories for ZerO-initialization
Users that are interested in ZerO-initialization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 광운대학교 컴퓨터 비전 AI 경진대회 1등 솔루션입니다.☆15Oct 5, 2022Updated 3 years ago
- 🥈12th place solution on G2Net Detecting Continuous Gravitational Waves🥈☆14Jan 4, 2023Updated 3 years ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- ☆15Nov 20, 2023Updated 2 years ago
- 🎖️ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition🎖️☆16Sep 19, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 2 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Jun 14, 2020Updated 5 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- VS Code Extension for Kaggle☆22Dec 9, 2024Updated last year
- Code for the PAPA paper☆27Nov 8, 2022Updated 3 years ago
- ☆42Mar 23, 2023Updated 3 years ago
- Codebase for adaptive continual memory☆14Aug 15, 2023Updated 2 years ago
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Open-source docking pipeline leveraging pairwise statistics☆15Jul 26, 2024Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- RAnking Markers for CEll Segmentation☆12Feb 22, 2022Updated 4 years ago
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Serving large language model with transformers☆13Oct 18, 2022Updated 3 years ago
- Guiding Attention for Self-Supervised Learning with Transformers☆12Feb 8, 2023Updated 3 years ago
- ☆33Mar 1, 2023Updated 3 years ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆14Jul 28, 2023Updated 2 years ago
- Code for "Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary Domains"☆24May 13, 2024Updated last year
- a Jax/Flax inference code of StarCoder☆12Jun 12, 2023Updated 2 years ago
- SKT'22 AI Fellowship, 딥러닝 기반 흑백 이미지 컬러화 기술 개발☆13Jun 7, 2023Updated 2 years ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Nov 6, 2023Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Jun 22, 2023Updated 2 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- ☆12Jul 16, 2024Updated last year
- ☆10Dec 17, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆16Mar 13, 2023Updated 3 years ago
- Code for "Approaching Deep Learning through the Spectral Dynamics of Weights"☆13Oct 30, 2024Updated last year
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆253Sep 1, 2022Updated 3 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆81Aug 30, 2023Updated 2 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆20Oct 12, 2024Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Jul 9, 2023Updated 2 years ago