safety-research/inverse-scaling-ttc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/safety-research/inverse-scaling-ttc)

safety-research / inverse-scaling-ttc

Inverse Scaling in Test-Time Compute

☆26

Alternatives and similar repositories for inverse-scaling-ttc

Users that are interested in inverse-scaling-ttc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

princeton-nlp / unintentional-unalignment
View on GitHub
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆32Jan 7, 2026Updated 6 months ago
krishgoel / chronocept-baseline-models
View on GitHub
The official baseline implementations for Chronocept
☆10Mar 31, 2026Updated 3 months ago
cat-state / modded-nanogpt-moe
View on GitHub
☆17Sep 6, 2025Updated 10 months ago
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
Trustworthy-ML-Lab / ThinkEdit
View on GitHub
[EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…
☆19Dec 17, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
maxjeblick / llm-docstring-generator
View on GitHub
☆21Apr 13, 2024Updated 2 years ago
yuzhaouoe / pretraining-data-packing
View on GitHub
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆24Aug 18, 2024Updated last year
martin-marek / forgetting
View on GitHub
📄 Forgetting in Language Models: Capacity, Optimization, and Self-Generated Replay
☆24Jul 17, 2026Updated last week
Xalp / MARS
View on GitHub
Official Implementation of MARS
☆30Apr 21, 2026Updated 3 months ago
Leey21 / data-lineage
View on GitHub
Trace origins, shared sources, and contamination risk
☆25May 27, 2026Updated 2 months ago
Infini-AI-Lab / Kinetics
View on GitHub
Kinetics: Rethinking Test-Time Scaling Laws
☆87Jul 11, 2025Updated last year
declare-lab / resta
View on GitHub
Restore safety in fine-tuned language models through task arithmetic
☆33Mar 28, 2024Updated 2 years ago
VITA-Group / WeLore
View on GitHub
[ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications
☆52Oct 30, 2025Updated 8 months ago
PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
machilusZ / FastGen
View on GitHub
This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
☆44Aug 14, 2024Updated last year
sanowl / CoRAG
View on GitHub
this is based on the paper Chain-of-Retrieval Augmented Generation
☆15Mar 29, 2025Updated last year
Shen-Lab / Bayesian-L2O
View on GitHub
[ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…
☆14Aug 19, 2022Updated 3 years ago
GDPlumb / ExpO
View on GitHub
Explanation Optimization
☆13Oct 16, 2020Updated 5 years ago
rali-udem / gophi
View on GitHub
GOPHI: an AMR-to-English Verbalizer
☆11Feb 5, 2020Updated 6 years ago
jwkirchenbauer / mtp-lm
View on GitHub
Source code to accompany research paper on training multi token prediction language models using self-distillation.
☆39Feb 21, 2026Updated 5 months ago
declare-lab / safety-arithmetic
View on GitHub
☆13Jan 14, 2025Updated last year
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
castorini / nuggetizer
View on GitHub
☆28Apr 19, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆45Apr 17, 2023Updated 3 years ago
amansk / awsscripts
View on GitHub
Scripts for making Hadoop deployments in AWS easy
☆10Feb 26, 2014Updated 12 years ago
Raman1121 / FairTune
View on GitHub
A framework to optimize Parameter-Efficient Fine-Tuning for Fairness in Medical Image Analysis
☆12Feb 29, 2024Updated 2 years ago
TheMoskowitz / Sarcasm_Detector
View on GitHub
A neural network for sarcasm detection I trained on the reddit sarcasm database
☆13Jul 27, 2017Updated 9 years ago
IGITUGraz / SparseAdversarialTraining
View on GitHub
Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]
☆10Mar 14, 2022Updated 4 years ago
nikvaessen / Rethinking-Binarized-Neural-Network-Optimization
View on GitHub
Reproduction of "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization" for the Reproducibility challenge@NeurIPS…
☆11Jan 14, 2020Updated 6 years ago
cprakashagr / hog-svm-tf
View on GitHub
Implementing -- Histogram of oriented gradients / Support Vector Machine / TensorFlow
☆11Mar 15, 2017Updated 9 years ago
UNITES-Lab / HEXA-MoE
View on GitHub
Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"
☆15Mar 6, 2025Updated last year
aviaefrat / lmentry
View on GitHub
☆15Nov 22, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
e-spaulding / xpo
View on GitHub
☆12Jun 18, 2024Updated 2 years ago
xufangzhi / phi-Decoding
View on GitHub
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
☆107May 18, 2025Updated last year
JierunChen / SFT-RL-SynergyDilemma
View on GitHub
☆15Jan 14, 2026Updated 6 months ago
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
EleutherAI / improved-t5
View on GitHub
Experiments for efforts to train a new and improved t5
☆76Apr 15, 2024Updated 2 years ago
qcpolimi / SIGIR22_QuantumFeatureSelection
View on GitHub
This repository contains the source code for the article "Towards Feature Selection for Ranking and Classification Exploiting Quantum Ann…
☆10Jul 27, 2022Updated 4 years ago
archelogos / sequence-detector
View on GitHub
Identifying digits and sequences with CNNs using TensorFlow
☆17Aug 12, 2016Updated 9 years ago