Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.
☆12Jun 5, 2024Updated last year
Alternatives and similar repositories for Tri-RMSNorm
Users that are interested in Tri-RMSNorm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)☆17Aug 6, 2024Updated last year
- Metric Learning Library for Keras☆10Apr 24, 2019Updated 6 years ago
- 🚀 C header-only formattable assert macros library☆11Nov 5, 2021Updated 4 years ago
- A PyTorch native platform for training generative AI models☆16Nov 18, 2025Updated 4 months ago
- A novel variant of sliced Wasserstein based on a new slicing technique that utilizes the convolution operator.☆12Jan 14, 2023Updated 3 years ago
- Port of the fantastic Iconoir Icon Pack to Rust embedded devices, with a focus on speed, usability, and completeness.☆16Jan 15, 2024Updated 2 years ago
- Reference implementation of the BLZZRD variant of the BLISS Ring-LWE Signature Scheme☆17Oct 11, 2016Updated 9 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- ☆26May 24, 2023Updated 2 years ago
- ☆20Jun 13, 2025Updated 9 months ago
- Rust derive macros for automating the boring stuff.☆14Aug 3, 2025Updated 7 months ago
- ☆17Jul 25, 2023Updated 2 years ago
- A small header-only C++17 metaprogramming library☆21Aug 10, 2021Updated 4 years ago
- multi-master-paxos with 3 nodes☆14Apr 11, 2022Updated 3 years ago
- Bleeding edge low level Rust binding for GGML☆16Jun 26, 2024Updated last year
- [NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference☆17Nov 6, 2024Updated last year
- ☆15Oct 31, 2023Updated 2 years ago
- ☆11Dec 26, 2018Updated 7 years ago
- Implementation of Sparse Regression Codes (SPARCs)/Sparse Superposition Codes for communications over the AWGN channel.☆13Nov 23, 2021Updated 4 years ago
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.☆14Aug 20, 2025Updated 7 months ago
- Almost SOTA LLM architecture, with O(n) time complexity☆11Jan 19, 2025Updated last year
- Pandoc filter to execute python code blocks from a markdown and place print output and figures to a converted markdown file☆22Jun 21, 2022Updated 3 years ago
- Life before `main()`☆19Feb 2, 2021Updated 5 years ago
- Parsing and serialization support for PSSH boxes used in DRM systems☆15Mar 7, 2026Updated 2 weeks ago
- Bounded Balance Trees for Nim☆17Jun 10, 2019Updated 6 years ago
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆12Jul 22, 2024Updated last year
- ☆19Aug 24, 2022Updated 3 years ago
- Log-structured merge-tree implementation in Rust☆19Nov 6, 2018Updated 7 years ago
- Unofficial fork. No active support.☆10Jun 22, 2016Updated 9 years ago
- RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It…☆10Nov 30, 2021Updated 4 years ago
- Attention-based end-to-end ASR on TIMIT in PyTorch☆18Nov 9, 2021Updated 4 years ago
- CasHMC: A Cycle-accurate Simulator for Hybrid Memory Cube☆23Aug 10, 2018Updated 7 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- ☆30Mar 13, 2026Updated last week
- An official code for "MIMO is all you need"☆22Jan 24, 2024Updated 2 years ago
- ☆10Mar 29, 2022Updated 3 years ago
- WheelNext Website☆50Dec 19, 2025Updated 3 months ago
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆13Jul 23, 2023Updated 2 years ago
- Reproducing RigL (ICML 2020) as a part of ML Reproducibility Challenge 2020☆29Jan 6, 2022Updated 4 years ago