apple / ml-ogenLinks
☆13Updated last year
Alternatives and similar repositories for ml-ogen
Users that are interested in ml-ogen are comparing it to the libraries listed below
Sorting:
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆106Updated last year
- ☆14Updated last year
- ☆29Updated 2 years ago
- DUET: 2D Structured and Approximately Equivariant Representations, ICML 2023☆18Updated 2 years ago
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆25Updated last week
- ☆59Updated last year
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆20Updated 2 years ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆29Updated 2 weeks ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆51Updated 2 months ago
- ☆41Updated last year
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆78Updated 5 months ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆18Updated 3 months ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆83Updated 2 years ago
- PAL: Predictive Analysis & Laws of Large Language Models☆38Updated 9 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆60Updated 10 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆13Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆36Updated last year
- Official code for "TOAST: Transfer Learning via Attention Steering"☆186Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆22Updated last year
- Symphony: Interactive Data Widgets (CHI 2022)☆63Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated 2 weeks ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- ☆56Updated last year
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆25Updated 8 months ago
- ☆19Updated 7 months ago
- Tune-Mode ConvBN Blocks For Efficient Transfer Learning☆17Updated 2 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆78Updated 2 years ago
- ☆19Updated 8 months ago