One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆48Oct 20, 2025Updated 4 months ago
Alternatives and similar repositories for EVA
Users that are interested in EVA are comparing it to the libraries listed below
Sorting:
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs☆29Jun 7, 2024Updated last year
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 2 months ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- Reinforcement Learning with Pong in the Browser via TensorFlow.js☆17Jan 4, 2023Updated 3 years ago
- Differentiable Euler Characteristic Transform☆17Jun 18, 2024Updated last year
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Oct 23, 2022Updated 3 years ago
- ☆43Jul 22, 2024Updated last year
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆19May 31, 2025Updated 9 months ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆36Oct 1, 2025Updated 5 months ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆23Mar 16, 2025Updated 11 months ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22Nov 10, 2024Updated last year
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- Python library to use Pleias-RAG models☆68May 1, 2025Updated 10 months ago
- ☆25Apr 3, 2024Updated last year
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Dec 28, 2021Updated 4 years ago
- Probabilistic Type Inference using Graph Neural Networks☆50Dec 9, 2022Updated 3 years ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 8 months ago
- ☆23Sep 20, 2024Updated last year
- Quantification of Uncertainty with Adversarial Models☆29Jul 11, 2023Updated 2 years ago
- ☆25May 6, 2021Updated 4 years ago
- ICLR 2025☆31May 21, 2025Updated 9 months ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 7 months ago
- ☆34Aug 23, 2023Updated 2 years ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆33Feb 19, 2025Updated last year
- PyTorch implementation for our paper "Improving GFlowNets for Text-to-Image Diffusion Alignment."☆31Sep 6, 2024Updated last year
- Open source machine learning for graph-structured data☆30May 9, 2019Updated 6 years ago
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- ExDA☆13Sep 25, 2025Updated 5 months ago
- Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation…☆33Aug 4, 2023Updated 2 years ago
- ☆40Nov 22, 2025Updated 3 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆31May 16, 2024Updated last year
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 5 months ago