JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"
☆19Jun 10, 2023Updated 2 years ago
Alternatives and similar repositories for mezo-jax
Users that are interested in mezo-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- ☆10Jul 6, 2021Updated 4 years ago
- Optimizing Hyperparameters with Conformal Quantile Regression☆11May 22, 2023Updated 2 years ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- ☆32Aug 28, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆35Jul 5, 2023Updated 2 years ago
- Greedy Bayesian Posterior Approximation with Deep Ensembles. A. Tiulpin and M. B. Blaschko. (2021)☆11Jul 18, 2022Updated 3 years ago
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Oct 20, 2023Updated 2 years ago
- ☆19Jun 3, 2023Updated 2 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- Experiments with Super-Universal Newton method.☆13Aug 12, 2022Updated 3 years ago
- Code for the paper "Bias-Reduced Uncertainty Estimation for Deep Neural Classifiers" published in ICLR 2019☆13Apr 25, 2019Updated 7 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 3 years ago
- Python implementation of DPP sampling☆14Nov 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 3 months ago
- A domain-specific probabilistic programming language for modeling and inference with language models☆142Apr 29, 2025Updated last year
- ☆13Updated this week
- A framework for Bayesian optimization of composite functions.☆15Dec 8, 2022Updated 3 years ago
- ☆12Apr 26, 2022Updated 4 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆19May 11, 2019Updated 6 years ago
- Python package to sample from determinantal point processes☆18Jul 20, 2015Updated 10 years ago
- Tools for JAX☆51Apr 16, 2026Updated last week
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆22Mar 25, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An implementation of the Semantic Style Transfer in PyTorch. Original paper: https://arxiv.org/abs/1603.01768.☆15Oct 7, 2018Updated 7 years ago
- mu4e thread folding☆12Mar 30, 2023Updated 3 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago
- ☆15Sep 10, 2023Updated 2 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 5 months ago
- ☆19Dec 12, 2023Updated 2 years ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- A passion project on my favorite e-commerce site that scrapes product data and builds a recommendation engine☆10May 2, 2023Updated 2 years ago
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆22Jan 21, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- http://cdr.eurolisp.org/document/2/☆17Sep 14, 2025Updated 7 months ago
- Create a network of function dependencies between R packages☆14Jul 6, 2025Updated 9 months ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Feb 13, 2023Updated 3 years ago
- Examples of Prompt Engineering, Zero Shot Learning, Few Shot Learning and Retrieval Augmented Generation (RAG) using Hugging Face, Databr…☆16Sep 21, 2023Updated 2 years ago
- Comparison between GFlowNets & Maximum Entropy RL☆19Feb 19, 2024Updated 2 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Python library for classifier calibration☆20May 3, 2024Updated last year