☆26May 24, 2023Updated 2 years ago
Alternatives and similar repositories for pre-rmsnorm-transformer
Users that are interested in pre-rmsnorm-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 6, 2025Updated 7 months ago
- A Neural Operator-based Integrated Photonic Device Simulation Framework, NeurOLight NeurIPS 2022☆55Sep 14, 2023Updated 2 years ago
- ☆20Jan 4, 2024Updated 2 years ago
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- Python-based Electronic-Photonic Integrated System Architecture Modeling and Evaluation Framework (DAC 2025)☆138Jan 18, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆25Nov 10, 2021Updated 4 years ago
- Basics for Machine Learning☆12Apr 26, 2020Updated 5 years ago
- PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices, NeurIPs 2024☆17Dec 13, 2024Updated last year
- Silicon Photonics measurement data on manufacturing variability☆16Jul 28, 2020Updated 5 years ago
- Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator, HPCA'24☆41Feb 5, 2025Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 7 months ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 2 years ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆33Mar 24, 2026Updated 3 weeks ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆25Aug 11, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Automated Large-Scale PIC Routing (accepted at ISPD 2025)☆36Jan 13, 2026Updated 3 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 8 months ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 6 months ago
- ☆13Apr 1, 2026Updated 2 weeks ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Python binding for the G'MIC Image Processing Framework☆11Nov 14, 2025Updated 5 months ago
- ☆27Jul 18, 2025Updated 9 months ago
- [ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers☆17Mar 20, 2024Updated 2 years ago
- A conda-smithy repository for nvcc.☆13Jan 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆20Apr 16, 2025Updated last year
- Porting Postgres Server to WASM [WIP]☆16Mar 6, 2021Updated 5 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆10Apr 22, 2022Updated 3 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- ☆20May 14, 2025Updated 11 months ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 4 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Normalize CJK characters in text☆14Sep 30, 2025Updated 6 months ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last month
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 9 months ago
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago