graphcore-research / jax-scalifyView external linksLinks
JAX Scalify: end-to-end scaled arithmetics
☆18Oct 30, 2024Updated last year
Alternatives and similar repositories for jax-scalify
Users that are interested in jax-scalify are comparing it to the libraries listed below
Sorting:
- nanoGPT using Equinox☆15Mar 3, 2023Updated 2 years ago
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆19Nov 3, 2024Updated last year
- ☆18Aug 24, 2024Updated last year
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆23Aug 14, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- A port of muP to JAX/Haiku☆25Oct 23, 2022Updated 3 years ago
- ☆23Jun 18, 2024Updated last year
- ☆30Jul 5, 2023Updated 2 years ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Jul 31, 2024Updated last year
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- A collection of niche / personally useful PyTorch optimizers with modified code.☆27Oct 25, 2025Updated 3 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 8 months ago
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆47Sep 10, 2025Updated 5 months ago
- ☆31Jan 23, 2026Updated 3 weeks ago
- // clone this repo with --depth=1 to save disk size // toolchain compatible with Ubuntu 20.04+ //☆15Apr 28, 2022Updated 3 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆35Dec 27, 2023Updated 2 years ago
- Official implementation of ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis☆36May 30, 2025Updated 8 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Aug 15, 2023Updated 2 years ago
- Towards Systematic Measurement for Long Text Quality☆37Sep 5, 2024Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆89Oct 30, 2024Updated last year
- ☆34May 14, 2025Updated 9 months ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- An awesome list that curates the best Flet tools, tutorials, blogs and more.☆10Jan 8, 2023Updated 3 years ago
- GBM implementation on Legate☆14Jan 28, 2026Updated 2 weeks ago
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Jan 24, 2026Updated 2 weeks ago
- ☆16Jul 23, 2023Updated 2 years ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- ☆11Jul 17, 2023Updated 2 years ago
- A fine-mapping method integrating GWAS summary statistics and functional annotation data☆11Dec 28, 2023Updated 2 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 3 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Dec 22, 2024Updated last year
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago