rjha18 / vec2vecLinks
☆237Updated last month
Alternatives and similar repositories for vec2vec
Users that are interested in vec2vec are comparing it to the libraries listed below
Sorting:
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆146Updated this week
- PyTorch library for Active Fine-Tuning☆93Updated last week
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated 10 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆142Updated 3 weeks ago
- ☆82Updated last year
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆73Updated 3 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆42Updated 11 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Updated 6 months ago
- ☆109Updated 7 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- ☆55Updated last year
- Code for ExploreTom☆86Updated 3 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆193Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 5 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆70Updated 4 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- ☆81Updated last week
- ☆69Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆216Updated 2 months ago
- Getting crystal-like representations with harmonic loss☆194Updated 6 months ago
- ☆69Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆80Updated 10 months ago
- Discovering Data-driven Hypotheses in the Wild☆112Updated 3 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆92Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆31Updated 6 months ago
- Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation☆125Updated 3 months ago