JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"
☆19Jun 10, 2023Updated 2 years ago
Alternatives and similar repositories for mezo-jax
Users that are interested in mezo-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- Official PyTorch code for UAI 2023 paper "Concurrent Misclassification and Out-of-Distribution Detection for Semantic Segmentation via En…☆12Nov 10, 2023Updated 2 years ago
- ☆10Jul 6, 2021Updated 4 years ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Greedy Bayesian Posterior Approximation with Deep Ensembles. A. Tiulpin and M. B. Blaschko. (2021)☆11Jul 18, 2022Updated 3 years ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Oct 20, 2023Updated 2 years ago
- ☆19Jun 3, 2023Updated 3 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 3 years ago
- Code of the paper "Beyond calibration: estimating the grouping loss of modern neural networks" published in ICLR 2023.☆12Nov 21, 2023Updated 2 years ago
- Code for the paper "Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers" published in ICLR 2019☆13Apr 25, 2019Updated 7 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- [FG 2019 Oral] Attribute-Guided Sketch Generation☆10Jul 25, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Feb 12, 2025Updated last year
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 3 years ago
- ☆16Nov 19, 2021Updated 4 years ago
- Bayesian Optimization-Based Global Optimal Rank Selection for Compression of Convolutional Neural Networks, IEEE Access☆16Mar 21, 2021Updated 5 years ago
- Official repository for CVPR2023 publication, GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection☆19Sep 25, 2024Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆142Apr 29, 2025Updated last year
- A framework for Bayesian optimization of composite functions.☆15Dec 8, 2022Updated 3 years ago
- R-GAP: Recursive Gradient Attack on Privacy [Accepted at ICLR 2021]☆37Feb 20, 2023Updated 3 years ago
- ☆12Apr 26, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data Valuation on In-Context Examples (ACL23)☆24Jan 12, 2025Updated last year
- L4DC2021 code repository☆14Apr 14, 2021Updated 5 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆20May 11, 2019Updated 7 years ago
- Tools for JAX☆50Updated this week
- mu4e thread folding☆12Mar 30, 2023Updated 3 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- Implementation of the "Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition" paper.☆21Apr 13, 2021Updated 5 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆15Sep 10, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- Python GUI for differential forms☆13Oct 14, 2023Updated 2 years ago
- ☆19Dec 12, 2023Updated 2 years ago
- ☆30Feb 7, 2025Updated last year
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆23Jan 21, 2021Updated 5 years ago
- http://cdr.eurolisp.org/document/2/☆17Sep 14, 2025Updated 8 months ago
- Create a network of function dependencies between R packages☆15Jul 6, 2025Updated 11 months ago