schwartz-lab-NLP / Tokens2WordsLinks
☆13Updated 5 months ago
Alternatives and similar repositories for Tokens2Words
Users that are interested in Tokens2Words are comparing it to the libraries listed below
Sorting:
- ☆22Updated last month
- https://footprints.baulab.info☆17Updated 11 months ago
- ☆20Updated last year
- ☆19Updated 6 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆80Updated 9 months ago
- Codebase for Instruction Following without Instruction Tuning☆35Updated last year
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆16Updated 9 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆70Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Updated last year
- Exploration of automated dataset selection approaches at large scales.☆47Updated 6 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated 10 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆129Updated 3 months ago
- Efficient Scaling laws and collaborative pretraining.☆18Updated last week
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- ☆85Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- ☆23Updated 7 months ago
- ☆27Updated last year
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆12Updated last year
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Updated 5 months ago
- ☆15Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆83Updated 10 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆34Updated last month
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆24Updated 7 months ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆22Updated 4 months ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆15Updated 5 months ago
- Minimum Description Length probing for neural network representations☆18Updated 7 months ago