PyTorch interface for TrueGrad Optimizers
☆43Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for TrueGrad
Users that are interested in TrueGrad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 3 years ago
- ☆18Aug 24, 2024Updated last year
- Test pytorch code with minimal computational overhead☆26Jun 8, 2023Updated 2 years ago
- ☆33Nov 4, 2024Updated last year
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- ☆13May 4, 2026Updated 2 weeks ago
- ARC Community Project☆22Aug 2, 2024Updated last year
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 5 years ago
- ☆19Dec 4, 2025Updated 5 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- research impl of Native Sparse Attention (2502.11089)☆63Feb 19, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of a simple BPE tokenizer, but in Nim☆22Jul 2, 2023Updated 2 years ago
- AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks☆29Oct 26, 2022Updated 3 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆29Aug 5, 2025Updated 9 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Feb 23, 2024Updated 2 years ago
- ☆70Apr 14, 2026Updated last month
- This tool displays tflite signatures and rewrites the input/output OP name to the name of the signature. There is no need to install Tens…☆14Dec 13, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- Real-Time RTUs☆12Mar 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆17Oct 11, 2021Updated 4 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- ☆13Nov 27, 2025Updated 5 months ago
- minGPT in JAX☆49Jan 10, 2022Updated 4 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆12Jun 10, 2024Updated last year
- ☆10Apr 5, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- Bring ROS to any Linux distributions.☆15Jun 12, 2019Updated 6 years ago
- Awesome Triton Resources☆41Apr 27, 2025Updated last year
- Training hybrid models for dummies.☆30Nov 1, 2025Updated 6 months ago
- Another attempt at a long-context / efficient transformer by me☆38Apr 11, 2022Updated 4 years ago
- WIP☆95Aug 13, 2024Updated last year
- Image2StyleGAN and Image2StyleGAN++ implementation☆28Jul 15, 2021Updated 4 years ago