☆26May 24, 2023Updated 3 years ago
Alternatives and similar repositories for pre-rmsnorm-transformer
Users that are interested in pre-rmsnorm-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 6, 2025Updated 8 months ago
- Machine Learning-Enabled Compact Photonic Tensor Core based on Programmable Multi-Operand Multimode Interference☆14Sep 23, 2024Updated last year
- ☆20Jan 4, 2024Updated 2 years ago
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- ☆23Jun 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python-based Electronic-Photonic Integrated System Architecture Modeling and Evaluation Framework (DAC 2025)☆127Jan 18, 2026Updated 4 months ago
- Basics for Machine Learning☆12Apr 26, 2020Updated 6 years ago
- PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices, NeurIPs 2024☆18Dec 13, 2024Updated last year
- Implementation of a compact optical neural network SqueezeLight based on multi-operand micro-rings, DATE 2021☆15Oct 26, 2022Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- [3D Vision 2024] A new cascade framework named Cas6D for few-shot 6DoF pose estimation that is generalizable and uses only RGB images.☆23Jul 26, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 9 months ago
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation for Text Generation Beyond Discrete Token Sampling☆25Aug 11, 2025Updated 9 months ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆87Dec 1, 2023Updated 2 years ago
- Automated Large-Scale PIC Routing (accepted at ISPD 2025)☆36Jan 13, 2026Updated 4 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 10 months ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆46Apr 21, 2026Updated last month
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Python binding for the G'MIC Image Processing Framework☆11Nov 14, 2025Updated 6 months ago
- Deep Learning Homework 1☆20Oct 25, 2018Updated 7 years ago
- ☆28Jul 18, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a platform independent open source tool for open OCT images and create different labels on it☆14Sep 27, 2018Updated 7 years ago
- ☆15Nov 7, 2024Updated last year
- Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs (AAAI 2024)☆15Jul 30, 2024Updated last year
- A conda-smithy repository for nvcc.☆13Jan 23, 2025Updated last year
- Official Pytorch implementation of "StegaNeRF: Embedding Invisible Information within Neueral Radiance Fields", ICCV2023☆47Nov 23, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆10Apr 22, 2022Updated 4 years ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆20Apr 16, 2025Updated last year
- ☆20May 14, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 5 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- Normalize CJK characters in text☆14Sep 30, 2025Updated 8 months ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 3 years ago
- ☆11Nov 2, 2024Updated last year