☆26May 24, 2023Updated 2 years ago
Alternatives and similar repositories for pre-rmsnorm-transformer
Users that are interested in pre-rmsnorm-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 6, 2025Updated 6 months ago
- Machine Learning-Enabled Compact Photonic Tensor Core based on Programmable Multi-Operand Multimode Interference☆13Sep 23, 2024Updated last year
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- ☆23Jun 24, 2024Updated last year
- Python-based Electronic-Photonic Integrated System Architecture Modeling and Evaluation Framework (DAC 2025)☆137Jan 18, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆25Nov 10, 2021Updated 4 years ago
- Basics for Machine Learning☆12Apr 26, 2020Updated 5 years ago
- A python wrapper for the pardiso solver☆16Oct 15, 2025Updated 5 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 2 years ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆30Updated this week
- Official implementation for Text Generation Beyond Discrete Token Sampling☆24Aug 11, 2025Updated 7 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- a platform independent open source tool for open OCT images and create different labels on it☆13Sep 27, 2018Updated 7 years ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- Ph.D. thesis template based on the design guidelines of New York University - Tandon School of Engineering☆12Aug 21, 2018Updated 7 years ago
- [ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers☆17Mar 20, 2024Updated 2 years ago
- ☆15Nov 7, 2024Updated last year
- Porting Postgres Server to WASM [WIP]☆16Mar 6, 2021Updated 5 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- ☆20May 14, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 2 years ago
- Normalize CJK characters in text☆14Sep 30, 2025Updated 6 months ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Multiresolution Graph Transformers and Wavelet Positional Encoding for Learning Long-Range and Hierarchical Structures☆24Oct 27, 2023Updated 2 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- MLIR-based partitioning system☆174Updated this week
- ☆11Nov 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- RIBES is an automatic evaluation metric for machine translation.☆11Sep 7, 2017Updated 8 years ago
- ☆12Sep 25, 2024Updated last year
- Dynamic Youtube graphs☆27Dec 1, 2019Updated 6 years ago
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- A PyTorch Library for Photonic AI Computing Model Training and Co-Design (NeurIPS'21)☆317Jan 13, 2026Updated 2 months ago
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 8 months ago