chenzhiling9954/Critical-Tokens-Matter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenzhiling9954/Critical-Tokens-Matter)

chenzhiling9954 / Critical-Tokens-Matter

☆48

Alternatives and similar repositories for Critical-Tokens-Matter

Users that are interested in Critical-Tokens-Matter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
choosewhatulike / case2code
View on GitHub
☆17Apr 7, 2025Updated last year
yifanycc / AdaZeta
View on GitHub
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆13Dec 15, 2024Updated last year
yiqingxyq / RepoST
View on GitHub
Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"
☆24Mar 18, 2025Updated last year
HumainLab / Personalized_RLHF
View on GitHub
☆28Dec 9, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lemaoliu / retrieval-generation-tutorial
View on GitHub
☆11Jun 19, 2022Updated 4 years ago
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
ChartMimic / ChartMimic
View on GitHub
[ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
☆132Dec 19, 2025Updated 7 months ago
NineAbyss / S2R
View on GitHub
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆77Apr 22, 2025Updated last year
zankner / CLoud
View on GitHub
Critique-out-Loud Reward Models
☆76Oct 18, 2024Updated last year
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
rhubarbwu / linguistic-collapse
View on GitHub
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]
☆19Apr 14, 2025Updated last year
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆32Nov 5, 2025Updated 8 months ago
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
mzf666 / LORO-main
View on GitHub
Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'
☆17Apr 24, 2025Updated last year
Liac-li / MM-self-improve-qwen2vl
View on GitHub
☆13Dec 9, 2024Updated last year
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
iesl / protoqa-data
View on GitHub
Dataset for protoqa ("family feud") data
☆34Dec 2, 2021Updated 4 years ago
vitorpamplona / splitlearning
View on GitHub
Simple Python Socket-based Split Learning technique using PyTorch
☆14Mar 13, 2020Updated 6 years ago
peterbhase / SLAG-Belief-Updating
View on GitHub
Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"
☆28May 2, 2022Updated 4 years ago
caiqizh / LUQ
View on GitHub
☆15Jan 14, 2026Updated 6 months ago
RUCKBReasoning / CoT-based-Synthesizer
View on GitHub
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆32May 19, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
TEAM-ARM / arm
View on GitHub
[NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model
☆68Apr 6, 2026Updated 3 months ago
NonvolatileMemory / GliDe_with_a_CaPE_ICML_24
View on GitHub
official code for GliDe with a CaPE
☆22Aug 13, 2024Updated last year
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated 2 years ago
Pickl-3 / hitbox-fightstick-game-device
View on GitHub
☆14Jul 23, 2023Updated 3 years ago
mingliangzhang2018 / PGPS
View on GitHub
The implement of geometric solver PGPSNet
☆30Jul 8, 2026Updated 3 weeks ago
ise-uiuc / xft
View on GitHub
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
☆36Jul 2, 2024Updated 2 years ago
kyrieLei / Critic-V
View on GitHub
☆18Apr 23, 2025Updated last year
fonsc / factorsum
View on GitHub
Code for the paper Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents: https://arxiv.org/abs/2205.12…
☆12Feb 10, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kttian / llm_factuality_tuning
View on GitHub
☆41May 2, 2024Updated 2 years ago
URSA-MATH / URSA-MATH
View on GitHub
☆132Sep 20, 2025Updated 10 months ago
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
gmongaras / Cottention_Transformer
View on GitHub
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
☆20Nov 15, 2025Updated 8 months ago
PolarisRisingWar / Math_Word_Problem_Collection
View on GitHub
A collection for math word problem (MWP) works, including datasets, algorithms and so on.
☆47Jun 18, 2024Updated 2 years ago
zwhe99 / RaSA
View on GitHub
[ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation
☆10May 19, 2025Updated last year
Jarviswang94 / Multilingual_safety_benchmark
View on GitHub
Multilingual safety benchmark for Large Language Models
☆53Sep 1, 2024Updated last year