AdamW optimizer for bfloat16 models in pytorch π₯.
β40Jun 16, 2024Updated 2 years ago
Alternatives and similar repositories for adamw_bfloat16
Users that are interested in adamw_bfloat16 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β12Apr 26, 2024Updated 2 years ago
- Proxy server for quota, usage monitoring and tracking of OpenAI requestsβ16Sep 21, 2023Updated 2 years ago
- β16Dec 30, 2024Updated last year
- 4G GPU & 10 Minutes for trainβ12Aug 9, 2023Updated 2 years ago
- β18Aug 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Exercises Galois theory D. Coxβ13Jun 29, 2023Updated 2 years ago
- zero-vocab or low-vocab embeddingsβ18Jul 17, 2022Updated 3 years ago
- [Poster; ICLR 2026] [Oral; Neurips OPT2024] ΞΌLO: Compute-Efficient Meta-Generalization of Learned Optimizersβ16Apr 15, 2026Updated 2 months ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.β75Aug 2, 2024Updated last year
- Code associated to papers on superposition (in ML interpretability)β40Sep 13, 2022Updated 3 years ago
- Simple Tensorflow implementation of "SDIT: Scalable and Diverse Cross-domain Image Translation" (ACM-MM 2019)β16Oct 14, 2019Updated 6 years ago
- A spoken version of the textual story cloze benchmarkβ22Aug 6, 2023Updated 2 years ago
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorchβ15Jan 27, 2023Updated 3 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)β21Jun 22, 2023Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- β14Apr 7, 2022Updated 4 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORMβ18May 17, 2024Updated 2 years ago
- Variational Autoencoder with non-euclidean (hyperbolic) latent spaceβ13Nov 25, 2022Updated 3 years ago
- β19Feb 2, 2023Updated 3 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.β46Jul 17, 2024Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.β14Oct 7, 2024Updated last year
- Explore semantic caching to reduce your OpenAI/LLM API billβ11Jul 21, 2023Updated 2 years ago
- Utilities for PyTorch distributedβ25Feb 27, 2025Updated last year
- A Chinese version of A Neural Parametric Singing Synthesizerβ13Feb 12, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Optimized Utilities for PyTorchβ26Jan 19, 2020Updated 6 years ago
- A Prompt Expander OpenAI-Based.β14Nov 15, 2023Updated 2 years ago
- β125May 28, 2024Updated 2 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.β10Apr 16, 2024Updated 2 years ago
- Calculating Expected Time for training LLM.β39Apr 17, 2023Updated 3 years ago
- Generic build serverβ65May 25, 2014Updated 12 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"β21Apr 7, 2021Updated 5 years ago
- Detect nearby AirTags in disconnected or lost modes.β10Feb 3, 2022Updated 4 years ago
- JAX translation of boltzβ28Aug 4, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- High performance pytorch modulesβ18Jan 14, 2023Updated 3 years ago
- Garbage collector implementation in Rust for Rustβ13Aug 30, 2020Updated 5 years ago
- source code of EfficientTTS 2β21Feb 18, 2024Updated 2 years ago
- TPUμμ νκ΅μ΄μ© LLM μΆλ‘ μ μν Jax/Flax ꡬν체μ λλ€.β12Jun 12, 2023Updated 3 years ago
- β69Mar 21, 2025Updated last year
- Landing Page for Divide and Remaster v3β26Jul 29, 2025Updated 10 months ago
- A toolkit for finding and analysing the grammars of emergent languages.β11Nov 16, 2020Updated 5 years ago