Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)
☆16Sep 28, 2024Updated last year
Alternatives and similar repositories for random_transformers
Users that are interested in random_transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Jan 19, 2024Updated 2 years ago
- ☆19Oct 30, 2025Updated 6 months ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15May 1, 2026Updated 2 weeks ago
- ☆14Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Nov 22, 2019Updated 6 years ago
- Train small sequence models in your browser with WebGPU.☆34Dec 3, 2025Updated 5 months ago
- ☆18Updated this week
- ☆18Feb 4, 2025Updated last year
- ☆11Jun 20, 2023Updated 2 years ago
- ☆12Jun 27, 2024Updated last year
- A browser extension that generates links for all headers on the page (when it can) and makes it easier to share specific sections.☆16Mar 3, 2022Updated 4 years ago
- ☆13Nov 15, 2023Updated 2 years ago
- A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more gen …☆31May 11, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Apr 12, 2026Updated last month
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆43Jan 18, 2026Updated 4 months ago
- Get the css from a domain and timestamp via the wayback machine☆18Aug 17, 2017Updated 8 years ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated last year
- Interlink macvim & skim for an integrated LaTeX DE☆17Mar 14, 2016Updated 10 years ago
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated last year
- ☆38Jul 14, 2022Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆39Jul 14, 2025Updated 10 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- collection of example documents for use within cocalc's library☆17Sep 11, 2025Updated 8 months ago
- ☆15Feb 25, 2018Updated 8 years ago
- ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"☆17Jun 26, 2025Updated 10 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆26Mar 4, 2025Updated last year
- Self-supervised MPFNet for realistic bokeh effect rendering(JVCIR2022)☆14Jul 5, 2022Updated 3 years ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated last year
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Aug 19, 2024Updated last year
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆13May 22, 2024Updated last year
- Public Arena dataset☆15Jul 20, 2022Updated 3 years ago
- ☆208Nov 17, 2024Updated last year
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 5 months ago
- Visual Object Tracking: The Initialisation Problem☆14Nov 2, 2022Updated 3 years ago