naver-ai / burnLinks
Official Pytorch Implementation of Unsupervised Representation Learning for Binary Networks by Joint Classifier Training (CVPR 2022)
☆11Updated 3 years ago
Alternatives and similar repositories for burn
Users that are interested in burn are comparing it to the libraries listed below
Sorting:
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)☆40Updated 4 years ago
- ☆18Updated 2 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆84Updated 2 years ago
- ☆14Updated 3 years ago
- [ICLR 2023] RC-MAE☆53Updated last year
- Official repository for Fourier model that can generate periodic signals☆10Updated 3 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Updated 3 months ago
- ☆38Updated 2 years ago
- ☆19Updated 2 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Updated 4 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated 2 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated 2 years ago
- Inverse DALL-E for Optical Character Recognition☆38Updated 3 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated last year
- ☆14Updated 4 years ago
- The official code repository for MetricMT - a reward optimization method for NMT with learned metrics☆25Updated 4 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆16Updated last year
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆25Updated 3 years ago
- Calculating Expected Time for training LLM.☆38Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆18Updated last year
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆35Updated 3 months ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- Code for text augmentation method leveraging large-scale language models☆62Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Updated last year
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆16Updated 2 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Updated 2 years ago