fjzzq2002/random_transformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fjzzq2002/random_transformers)

fjzzq2002 / random_transformers

Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)

☆15

Alternatives and similar repositories for random_transformers

Users that are interested in random_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apple / ml-np-rasp
View on GitHub
☆22Jan 19, 2024Updated 2 years ago
mghasemi / Irene
View on GitHub
Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…
☆15Jul 10, 2026Updated last week
liziniu / HyperDQN
View on GitHub
Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)
☆12Nov 28, 2023Updated 2 years ago
blei-lab / circuitry
View on GitHub
☆15Oct 30, 2024Updated last year
StevenHickson / CreateNormals
View on GitHub
☆11Nov 22, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pentagonalize / Transformer-Cookbook
View on GitHub
☆18Feb 4, 2025Updated last year
JasonGross / guarantees-based-mechanistic-interpretability
View on GitHub
☆18Updated this week
NickyFot / ACMMM22_LearningLabelRelationships
View on GitHub
☆11Jun 20, 2023Updated 3 years ago
FL33TW00D / wgpu-bench
View on GitHub
☆12Jun 27, 2024Updated 2 years ago
moelody / link-to-server
View on GitHub
☆13Nov 15, 2023Updated 2 years ago
grantwinney / generate-links-for-headers
View on GitHub
A browser extension that generates links for all headers on the page (when it can) and makes it easier to share specific sections.
☆16Mar 3, 2022Updated 4 years ago
SunTongtongtong / Benchmark-Robustness-Text-Image-Compose-Retrieval
View on GitHub
☆13Apr 12, 2026Updated 3 months ago
cssstats / wayback-css
View on GitHub
Get the css from a domain and timestamp via the wayback machine
☆18Aug 17, 2017Updated 8 years ago
peterljq / Parsimonious-Concept-Engineering
View on GitHub
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆43Jan 18, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zfw-cv / MPFNet
View on GitHub
Self-supervised MPFNet for realistic bokeh effect rendering(JVCIR2022)
☆14Jul 5, 2022Updated 4 years ago
rmovva / wimhf
View on GitHub
What's In My Human Feedback? Explaining preferences in human feedback using interpretability + LLMs. https://arxiv.org/abs/2510.26202
☆26May 9, 2026Updated 2 months ago
nikitadurasov / torch-ttt
View on GitHub
A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more gen…
☆32Jul 13, 2026Updated last week
liziniu / KnapsackRL
View on GitHub
☆19Oct 30, 2025Updated 8 months ago
shauli-ravfogel / rlace-icml
View on GitHub
☆39Jul 14, 2022Updated 4 years ago
fiveai / understanding_safety_finetuning
View on GitHub
Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)
☆12Oct 31, 2024Updated last year
davisrbr / conjectures-arxiv
View on GitHub
OpenConjecture, a dataset of mathematics conjectures pulled from papers published to the ArXiv
☆15Jul 12, 2026Updated last week
vectozavr / llm-hessian
View on GitHub
Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models
☆29Apr 17, 2025Updated last year
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
caobokai / DeepMood
View on GitHub
☆15Feb 25, 2018Updated 8 years ago
JuniMay / llm.rs
View on GitHub
An attempt to migrate Karpathy's llm.c to safe rust.
☆13Jun 4, 2024Updated 2 years ago
KingJamesSong / HouseholderGAN
View on GitHub
ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"
☆17Jul 7, 2026Updated 2 weeks ago
sagemathinc / cocalc-examples
View on GitHub
collection of example documents for use within cocalc's library
☆17Sep 11, 2025Updated 10 months ago
renatoberlinghieri / Helmholtz-GP
View on GitHub
☆11Mar 13, 2023Updated 3 years ago
RobertCsordas / onion_representations
View on GitHub
☆13Aug 19, 2024Updated last year
rhubarbwu / linguistic-collapse
View on GitHub
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]
☆19Apr 14, 2025Updated last year
zshipko / futhark-bindgen
View on GitHub
A Futhark binding generator for Rust and OCaml
☆32Feb 3, 2026Updated 5 months ago
security0528 / PublicArena
View on GitHub
Public Arena dataset
☆16Jul 20, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lawrennd / neurips2014
View on GitHub
Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.
☆13May 22, 2024Updated 2 years ago
jacobdunefsky / transcoder_circuits
View on GitHub
☆212Nov 17, 2024Updated last year
bgub / tokka-bench
View on GitHub
benchmarks for LLM tokenizers
☆20Mar 25, 2026Updated 3 months ago
FreedomIntelligence / TinyDeepSeek
View on GitHub
Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.
☆30Mar 11, 2025Updated last year
lacoco-lab / decompiling_transformers
View on GitHub
Repo for Paper: Discovering Interpretable Algorithms by Decompiling Transformers to RASP
☆15May 25, 2026Updated last month
Shen-Lab / Bayesian-L2O
View on GitHub
[ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…
☆14Aug 19, 2022Updated 3 years ago
marketdesignresearch / NOMU
View on GitHub
NOMU: Neural Optimization-based Model Uncertainty
☆10Feb 17, 2023Updated 3 years ago