AdamW optimizer for bfloat16 models in pytorch π₯.
β40Jun 16, 2024Updated last year
Alternatives and similar repositories for adamw_bfloat16
Users that are interested in adamw_bfloat16 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β12Apr 26, 2024Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)β29Jun 28, 2024Updated last year
- βοΈ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) modelsβ39May 2, 2026Updated 3 weeks ago
- Proxy server for quota, usage monitoring and tracking of OpenAI requestsβ16Sep 21, 2023Updated 2 years ago
- β16Dec 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 4G GPU & 10 Minutes for trainβ12Aug 9, 2023Updated 2 years ago
- β18Aug 24, 2024Updated last year
- Notes on some important deep learning topics and paper summariesβ13Dec 16, 2020Updated 5 years ago
- Exercises Galois theory D. Coxβ13Jun 29, 2023Updated 2 years ago
- zero-vocab or low-vocab embeddingsβ18Jul 17, 2022Updated 3 years ago
- [Poster; ICLR 2026] [Oral; Neurips OPT2024] ΞΌLO: Compute-Efficient Meta-Generalization of Learned Optimizersβ16Apr 15, 2026Updated last month
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.β75Aug 2, 2024Updated last year
- BFloat16 Fused Adam Operator for PyTorchβ19Nov 16, 2024Updated last year
- Simple Tensorflow implementation of "SDIT: Scalable and Diverse Cross-domain Image Translation" (ACM-MM 2019)β16Oct 14, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A spoken version of the textual story cloze benchmarkβ22Aug 6, 2023Updated 2 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)β21Jun 22, 2023Updated 2 years ago
- β17Dec 12, 2023Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β29Apr 17, 2024Updated 2 years ago
- β19Feb 2, 2023Updated 3 years ago
- Unsupervised Rhythm Modeling for Voice Conversionβ85Aug 3, 2023Updated 2 years ago
- Explore semantic caching to reduce your OpenAI/LLM API billβ11Jul 21, 2023Updated 2 years ago
- Utilities for PyTorch distributedβ25Feb 27, 2025Updated last year
- A Chinese version of A Neural Parametric Singing Synthesizerβ13Feb 12, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The fastest Tropical number matrix multiplication on GPUβ10Aug 23, 2025Updated 9 months ago
- A Prompt Expander OpenAI-Based.β14Nov 15, 2023Updated 2 years ago
- β124May 28, 2024Updated 2 years ago
- β14Jun 9, 2023Updated 2 years ago
- Calculating Expected Time for training LLM.β39Apr 17, 2023Updated 3 years ago
- Generic build serverβ65May 25, 2014Updated 12 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"β21Apr 7, 2021Updated 5 years ago
- High performance pytorch modulesβ18Jan 14, 2023Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processesβ10Oct 24, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TPUμμ νκ΅μ΄μ© LLM μΆλ‘ μ μν Jax/Flax ꡬν체μ λλ€.β12Jun 12, 2023Updated 2 years ago
- β69Mar 21, 2025Updated last year
- Landing Page for Divide and Remaster v3β26Jul 29, 2025Updated 10 months ago
- A toolkit for finding and analysing the grammars of emergent languages.β11Nov 16, 2020Updated 5 years ago
- Instruction Following Evalβ17Jan 16, 2025Updated last year
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!β10Sep 1, 2024Updated last year
- A sample app to debug and validate cellular modems on balena devicesβ13Jun 5, 2019Updated 6 years ago