Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)
☆16Sep 28, 2024Updated last year
Alternatives and similar repositories for random_transformers
Users that are interested in random_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jan 19, 2024Updated 2 years ago
- ☆19Oct 30, 2025Updated 6 months ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Updated this week
- ☆14Oct 30, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆11Nov 22, 2019Updated 6 years ago
- Train small sequence models in your browser with WebGPU.☆34Dec 3, 2025Updated 4 months ago
- ☆18Updated this week
- ☆18Feb 4, 2025Updated last year
- ☆11Jun 20, 2023Updated 2 years ago
- ☆12Jun 27, 2024Updated last year
- A browser extension that generates links for all headers on the page (when it can) and makes it easier to share specific sections.☆16Mar 3, 2022Updated 4 years ago
- ☆13Nov 15, 2023Updated 2 years ago
- A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more gen…☆31Apr 20, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Apr 12, 2026Updated 2 weeks ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆43Jan 18, 2026Updated 3 months ago
- Get the css from a domain and timestamp via the wayback machine☆18Aug 17, 2017Updated 8 years ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated last year
- Interlink macvim & skim for an integrated LaTeX DE☆17Mar 14, 2016Updated 10 years ago
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated last year
- ☆37Jul 14, 2022Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆37Jul 14, 2025Updated 9 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- collection of example documents for use within cocalc's library☆17Sep 11, 2025Updated 7 months ago
- ☆15Feb 25, 2018Updated 8 years ago
- ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"☆17Jun 26, 2025Updated 10 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆26Mar 4, 2025Updated last year
- Self-supervised MPFNet for realistic bokeh effect rendering(JVCIR2022)☆14Jul 5, 2022Updated 3 years ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated last year
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Aug 19, 2024Updated last year
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆13May 22, 2024Updated last year
- Public Arena dataset☆15Jul 20, 2022Updated 3 years ago
- Code for "Kuramoto Orientation Diffusion"☆28Nov 7, 2025Updated 5 months ago
- ☆206Nov 17, 2024Updated last year
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 4 months ago