Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)
☆16Sep 28, 2024Updated last year
Alternatives and similar repositories for random_transformers
Users that are interested in random_transformers are comparing it to the libraries listed below
Sorting:
- ☆21Jan 19, 2024Updated 2 years ago
- ☆18Oct 30, 2025Updated 4 months ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Updated this week
- ☆14Oct 30, 2024Updated last year
- ☆11Nov 22, 2019Updated 6 years ago
- Train small sequence models in your browser with WebGPU.☆33Dec 3, 2025Updated 3 months ago
- ☆18Mar 13, 2026Updated last week
- ☆18Feb 4, 2025Updated last year
- ☆11Jun 20, 2023Updated 2 years ago
- ☆12Jun 27, 2024Updated last year
- A browser extension that generates links for all headers on the page (when it can) and makes it easier to share specific sections.☆16Mar 3, 2022Updated 4 years ago
- ☆13Nov 15, 2023Updated 2 years ago
- A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more gen…☆31Updated this week
- ☆12Jun 26, 2024Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆43Jan 18, 2026Updated 2 months ago
- Get the css from a domain and timestamp via the wayback machine☆18Aug 17, 2017Updated 8 years ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated 11 months ago
- Interlink macvim & skim for an integrated LaTeX DE☆17Mar 14, 2016Updated 10 years ago
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated 11 months ago
- ☆36Jul 14, 2022Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆35Jul 14, 2025Updated 8 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- collection of example documents for use within cocalc's library☆16Sep 11, 2025Updated 6 months ago
- ☆15Feb 25, 2018Updated 8 years ago
- ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"☆17Jun 26, 2025Updated 8 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆25Mar 4, 2025Updated last year
- Self-supervised MPFNet for realistic bokeh effect rendering(JVCIR2022)☆14Jul 5, 2022Updated 3 years ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated 11 months ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated last year
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆13May 22, 2024Updated last year
- ☆13Aug 19, 2024Updated last year
- Public Arena dataset☆14Jul 20, 2022Updated 3 years ago
- ☆201Nov 17, 2024Updated last year
- Code for "Kuramoto Orientation Diffusion"☆27Nov 7, 2025Updated 4 months ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Visual Object Tracking: The Initialisation Problem☆14Nov 2, 2022Updated 3 years ago