facebookresearch / NasRec
NASRec Weight Sharing Neural Architecture Search for Recommender Systems
☆29Updated last year
Alternatives and similar repositories for NasRec:
Users that are interested in NasRec are comparing it to the libraries listed below
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆32Updated 9 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 2 months ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆28Updated 8 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆43Updated 7 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 4 months ago
- Core Utilities for NVIDIA Merlin☆19Updated 6 months ago
- Triton kernels for Flux☆20Updated last month
- A block oriented training approach for inference time optimization.☆32Updated 6 months ago
- Linear Attention Sequence Parallelism (LASP)☆77Updated 8 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 6 months ago
- The Efficiency Spectrum of LLM☆53Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆45Updated last year
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆59Updated this week
- Official code for "Binary embedding based retrieval at Tencent"☆42Updated 11 months ago
- Learning Compiler Pass Orders using Coreset and Normalized Value Prediction. (ICML 2023)☆18Updated last year
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆47Updated 7 months ago
- ACL 2023☆38Updated last year
- ☆31Updated 8 months ago
- Make triton easier☆44Updated 8 months ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆38Updated 11 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- Model compression for ONNX☆86Updated 3 months ago
- [NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems☆29Updated last year
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆29Updated 6 months ago