JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"
☆19Jun 10, 2023Updated 3 years ago
Alternatives and similar repositories for mezo-jax
Users that are interested in mezo-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jul 6, 2021Updated 4 years ago
- Optimizing Hyperparameters with Conformal Quantile Regression☆11May 22, 2023Updated 3 years ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- ☆32Aug 28, 2020Updated 5 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Oct 20, 2023Updated 2 years ago
- ☆19Jun 3, 2023Updated 3 years ago
- Experiments with Super-Universal Newton method.☆13Aug 12, 2022Updated 3 years ago
- Code for the paper "Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers" published in ICLR 2019☆13Apr 25, 2019Updated 7 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- [FG 2019 Oral] Attribute-Guided Sketch Generation☆10Jul 25, 2021Updated 4 years ago
- Reimplementation of facebook's DinoV2 in JAX. Inference (with pretrained weights) only; training is unsupported.☆12Jun 25, 2024Updated 2 years ago
- Source codes of "Fast Continuous Subgraph Matching over Streaming Graphs via Backtracking Reduction", SIGMOD 2023☆14Sep 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Feb 12, 2025Updated last year
- ☆15Apr 14, 2025Updated last year
- ZOSVRG-BlackBox-Adv☆13Oct 30, 2018Updated 7 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 3 years ago
- ☆16Nov 19, 2021Updated 4 years ago
- ☆17Apr 20, 2025Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆143Apr 29, 2025Updated last year
- A framework for Bayesian optimization of composite functions.☆15Dec 8, 2022Updated 3 years ago
- R-GAP: Recursive Gradient Attack on Privacy [Accepted at ICLR 2021]☆37Feb 20, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Apr 26, 2022Updated 4 years ago
- L4DC2021 code repository☆14Apr 14, 2021Updated 5 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆20May 11, 2019Updated 7 years ago
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆22Mar 25, 2023Updated 3 years ago
- An implementation of the Semantic Style Transfer in PyTorch. Original paper: https://arxiv.org/abs/1603.01768.☆15Oct 7, 2018Updated 7 years ago
- mu4e thread folding☆12Mar 30, 2023Updated 3 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- The source code of "A Comprehensive Survey and Experimental Study of Subgraph Matching: Trends, Unbiasedness, and Interaction"☆17Sep 6, 2024Updated last year
- Implementation of the "Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition" paper.☆21Apr 13, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Dec 23, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆15Sep 10, 2023Updated 2 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- ☆19Dec 12, 2023Updated 2 years ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 3 years ago
- ☆31Feb 7, 2025Updated last year