One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆53Oct 20, 2025Updated 7 months ago
Alternatives and similar repositories for EVA
Users that are interested in EVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆28Jun 7, 2024Updated 2 years ago
- ☆44Jul 22, 2024Updated last year
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆25Mar 16, 2025Updated last year
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆20May 31, 2025Updated last year
- ☆36Aug 23, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Oct 23, 2022Updated 3 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated last year
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆39May 2, 2026Updated last month
- ☆25May 6, 2021Updated 5 years ago
- Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022☆10Jun 3, 2022Updated 4 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22May 8, 2026Updated last month
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆34Feb 19, 2025Updated last year
- ☆25Apr 3, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Python library to use Pleias-RAG models☆72May 8, 2026Updated last month
- Reinforcement Learning with Pong in the Browser via TensorFlow.js☆17Jan 4, 2023Updated 3 years ago
- (UNUSED) Early endpoint Void used to check for updates. Replaced by new build pipeline.☆12Dec 12, 2025Updated 6 months ago
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022☆11Dec 6, 2022Updated 3 years ago
- Code for paper: A Neural Span-Based Continual Named Entity Recognition Model☆18Dec 11, 2023Updated 2 years ago
- ☆126Jul 6, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- ☆36Jun 29, 2022Updated 3 years ago
- ☆11Sep 9, 2024Updated last year
- Code repository of the paper "Alleviating Adversarial Attacks on Variational Autoencoders with MCMC" published at NeurIPS 2022. https://a…☆10Dec 14, 2022Updated 3 years ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- ☆15Apr 29, 2024Updated 2 years ago
- Holds docker images and run scripts for BobbleBot simulation environment.☆12May 18, 2019Updated 7 years ago
- Un template per il piano di lavoro utile per gli studenti della laurea triennale in Computer Science @Unipadova che devono iniziare lo st…☆10May 20, 2018Updated 8 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Nov 30, 2024Updated last year
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 11 months ago
- Notes from the Computational Mathematics course held by professor Antonio Frangioni and professor Federico Poloni at University of Pisa☆12Aug 28, 2021Updated 4 years ago
- [NeurIPS'23 Spotlight] Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance (LPS), in PyTorch☆30Apr 13, 2024Updated 2 years ago
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆89Mar 27, 2026Updated 2 months ago
- ☆14Jul 13, 2022Updated 3 years ago
- Official PyTorch implementation of our paper "Dispersing Prompt Expansion for Class-Agnostic Object Detection" (NeurIPS 2024)☆14Jan 19, 2025Updated last year