bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆127Updated 4 months ago
Alternatives and similar repositories for zett
Users that are interested in zett are comparing it to the libraries listed below
Sorting:
- ☆38Updated last year
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆135Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆115Updated 8 months ago
- ☆72Updated last year
- The HELMET Benchmark☆143Updated 3 weeks ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated 11 months ago
- This is the official repository for Inheritune.☆111Updated 3 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- ☆120Updated 7 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆92Updated 3 weeks ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆166Updated 4 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆87Updated 6 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆73Updated last year
- Code repository for the c-BTM paper☆106Updated last year
- code for training & evaluating Contextual Document Embedding models☆184Updated this week
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆72Updated 6 months ago
- Code for the paper "Fishing for Magikarp"☆155Updated last week
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆25Updated last year
- ☆52Updated 11 months ago
- ☆73Updated 6 months ago
- Understand and test language model architectures on synthetic tasks.☆195Updated 2 months ago
- ☆97Updated 10 months ago
- ☆34Updated last month
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆77Updated last month
- ☆177Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆42Updated 5 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last week