☆26May 24, 2023Updated 2 years ago
Alternatives and similar repositories for pre-rmsnorm-transformer
Users that are interested in pre-rmsnorm-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 6, 2025Updated 8 months ago
- ☆13Aug 23, 2024Updated last year
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- ☆23Jun 24, 2024Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆25Nov 10, 2021Updated 4 years ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆13Jun 5, 2024Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- Silicon Photonics measurement data on manufacturing variability☆16Jul 28, 2020Updated 5 years ago
- Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator, HPCA'24☆41Feb 5, 2025Updated last year
- 基於 “正方软件股份有限公司” 的教務管理平台提供驗證碼 識別服務(公開版)☆10Oct 4, 2018Updated 7 years ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 3 years ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 6 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Ph.D. thesis template based on the design guidelines of New York University - Tandon School of Engineering☆12Aug 21, 2018Updated 7 years ago
- ☆28Jul 18, 2025Updated 9 months ago
- [ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers☆17Mar 20, 2024Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- a platform independent open source tool for open OCT images and create different labels on it☆14Sep 27, 2018Updated 7 years ago
- ☆15Nov 7, 2024Updated last year
- Converts GDSII (IC layout database) files to SVG (Vector graphics) files.☆14Feb 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs (AAAI 2024)☆15Jul 30, 2024Updated last year
- A conda-smithy repository for nvcc.☆13Jan 23, 2025Updated last year
- Porting Postgres Server to WASM [WIP]☆16Mar 6, 2021Updated 5 years ago
- Ibis is a Hands-Free Interactive Web Page. Using the latest generative AI, it can be Any Page.☆21Oct 30, 2024Updated last year
- ☆34Oct 13, 2025Updated 6 months ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 4 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- Normalize CJK characters in text☆14Sep 30, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Multiresolution Graph Transformers and Wavelet Positional Encoding for Learning Long-Range and Hierarchical Structures☆25Oct 27, 2023Updated 2 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- RIBES is an automatic evaluation metric for machine translation.☆13Sep 7, 2017Updated 8 years ago
- Dynamic Youtube graphs☆27Dec 1, 2019Updated 6 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last month
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago