One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆53Oct 20, 2025Updated 7 months ago
Alternatives and similar repositories for EVA
Users that are interested in EVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆28Jun 7, 2024Updated last year
- ☆44Jul 22, 2024Updated last year
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆19May 31, 2025Updated 11 months ago
- Quantification of Uncertainty with Adversarial Models☆29Jul 11, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated last year
- ☆36Aug 23, 2023Updated 2 years ago
- ICLR 2025☆30May 21, 2025Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated 11 months ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆39May 2, 2026Updated 3 weeks ago
- ☆25May 6, 2021Updated 5 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- ☆221Nov 25, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆34Feb 19, 2025Updated last year
- ☆25Apr 3, 2024Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Reinforcement Learning with Pong in the Browser via TensorFlow.js☆17Jan 4, 2023Updated 3 years ago
- Code for the paper "Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning"☆16Jul 4, 2022Updated 3 years ago
- ☆11Jul 21, 2024Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- Code for paper: A Neural Span-Based Continual Named Entity Recognition Model☆18Dec 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆126Jul 6, 2024Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- ☆36Jun 29, 2022Updated 3 years ago
- PyTorch implementation for our paper "Improving GFlowNets for Text-to-Image Diffusion Alignment."☆31Sep 6, 2024Updated last year
- ☆41Dec 19, 2024Updated last year
- ☆19Apr 16, 2025Updated last year
- [ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…☆14Sep 22, 2024Updated last year
- Implementations of different reinforcement learning algorithms☆10Aug 23, 2018Updated 7 years ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Apr 29, 2024Updated 2 years ago
- Un template per il piano di lavoro utile per gli studenti della laurea triennale in Computer Science @Unipadova che devono iniziare lo st…☆10May 20, 2018Updated 8 years ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆21Apr 9, 2026Updated last month
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 10 months ago
- Notes from the Computational Mathematics course held by professor Antonio Frangioni and professor Federico Poloni at University of Pisa☆12Aug 28, 2021Updated 4 years ago
- [NeurIPS'23 Spotlight] Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance (LPS), in PyTorch☆30Apr 13, 2024Updated 2 years ago