affjljoo3581 / deit3-jaxLinks
Jax/Flax implementation of DeiT and DeiT-III (ViT)
โ18Updated 9 months ago
Alternatives and similar repositories for deit3-jax
Users that are interested in deit3-jax are comparing it to the libraries listed below
Sorting:
- ๐งชcategorical tabnet research part๐งชโ13Updated last year
- ๐ฅ12th place solution on G2Net Detecting Continuous Gravitational Waves๐ฅโ14Updated 2 years ago
- ์ปค๋ฒ๋ฆฌ์คํธ - ๋ถ ์ปค๋ฒ ์์ฑ AI ์๋น์คโ12Updated 3 years ago
- ๊ด์ด๋ํ๊ต ์ปดํจํฐ ๋น์ AI ๊ฒฝ์ง๋ํ 1๋ฑ ์๋ฃจ์ ์ ๋๋ค.โ15Updated 3 years ago
- TPU์์ ํ๊ตญ์ด์ฉ LLM ์ถ๋ก ์ ์ํ Jax/Flax ๊ตฌํ์ฒด์ ๋๋ค.โ12Updated 2 years ago
- [2022.05.16 ~ 2022.06.10] ๐ค๏ธ๋ฏธ์ธ๋จผ์ง ์๋ ๋ง์ ์ฌ์ง๐ท - ๋ถ์คํธ์บ ํ AI Tech 3๊ธฐ ์ต์ข ํ๋ก์ ํธโ14Updated 3 years ago
- Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.โ48Updated 3 years ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.โ24Updated 5 months ago
- read 1 paper everyday (only weekday)โ56Updated 4 years ago
- Model Stock: All we need is just a few fine-tuned modelsโ125Updated 2 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adamโ85Updated last year
- ๐๏ธ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition๐๏ธโ17Updated 2 years ago
- โ60Updated last month
- Learning Features with Parameter-Free Layers, ICLR 2022โ84Updated 2 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.โ31Updated last month
- KAIST AI605 Deep Learning for NLPโ31Updated 3 years ago
- SKT'22 AI Fellowship, ๋ฅ๋ฌ๋ ๊ธฐ๋ฐ ํ๋ฐฑ ์ด๋ฏธ์ง ์ปฌ๋ฌํ ๊ธฐ์ ๊ฐ๋ฐโ13Updated 2 years ago
- โ12Updated 3 years ago
- Deploy KoGPT with Triton Inference Serverโ14Updated 2 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetesโ20Updated 2 years ago
- Inverse DALL-E for Optical Character Recognitionโ38Updated 3 years ago
- ๐ฅ LG-AI-Challenge 2022 1์ ์๋ฃจ์ ์ ๋๋ค.โ13Updated 2 years ago
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)โ40Updated 3 years ago
- [ICLR 2023] RC-MAEโ53Updated last year
- Getting GPU Util 99%โ33Updated 4 years ago
- Serving large language model with transformersโ13Updated 3 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attentionโ41Updated 4 years ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Imagesโ66Updated 3 years ago
- My collection of machine learning papersโ291Updated 2 years ago
- Bilinear Attention Networks for Korean Visual Question Answeringโ24Updated last year