facebookresearch / NasRecLinks
NASRec Weight Sharing Neural Architecture Search for Recommender Systems
☆31Updated 2 years ago
Alternatives and similar repositories for NasRec
Users that are interested in NasRec are comparing it to the libraries listed below
Sorting:
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated 2 years ago
- Official code for "Binary embedding based retrieval at Tencent"☆44Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- A dashboard for exploring timm learning rate schedulers☆19Updated last year
- ☆34Updated 5 months ago
- Model compression for ONNX☆99Updated last year
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆102Updated last year
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆12Updated last year
- Utilities for Training Very Large Models☆58Updated last year
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- The Triton backend for the PyTorch TorchScript models.☆165Updated last week
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆96Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Updated 2 years ago
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆49Updated 2 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆37Updated last year
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆119Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Updated 9 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- Implementation of Infini-Transformer in Pytorch☆113Updated 10 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆59Updated 2 years ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Timm model explorer☆42Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- DPO, but faster 🚀☆46Updated 11 months ago
- Linear Attention Sequence Parallelism (LASP)☆87Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆57Updated this week