rjha18 / vec2vecLinks
☆233Updated 3 weeks ago
Alternatives and similar repositories for vec2vec
Users that are interested in vec2vec are comparing it to the libraries listed below
Sorting:
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆146Updated 3 months ago
- ☆141Updated last week
- PyTorch library for Active Fine-Tuning☆91Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Getting crystal-like representations with harmonic loss☆194Updated 5 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆70Updated 2 months ago
- Code for ExploreTom☆86Updated 2 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 2 months ago
- ☆82Updated last year
- ☆107Updated 7 months ago
- ☆80Updated last week
- ☆53Updated 9 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆88Updated 2 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆42Updated 11 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Updated 6 months ago
- ☆40Updated last year
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆47Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 4 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆67Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated 10 months ago
- Open source interpretability artefacts for R1.☆158Updated 4 months ago
- ☆69Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆192Updated last year
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆67Updated 3 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 7 months ago
- Code repository for Black Mamba☆254Updated last year
- ☆54Updated 6 months ago