codelion / pts
Pivotal Token Search
☆23Updated this week
Alternatives and similar repositories for pts
Users that are interested in pts are comparing it to the libraries listed below
Sorting:
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆16Updated 2 months ago
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- ☆27Updated 2 months ago
- ☆48Updated 6 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- Official Repository for Task-Circuit Quantization☆20Updated 2 weeks ago
- ☆13Updated 5 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated 2 months ago
- ☆20Updated 5 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated last month
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 5 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 5 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- ☆33Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 4 months ago
- ☆64Updated last month
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆27Updated 2 weeks ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- ☆25Updated 8 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Lottery Ticket Adaptation☆39Updated 5 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆44Updated this week
- ☆48Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago