microsoft / TextNAS
This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text Representation.
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TextNAS
- Factorized Neural Layers☆27Updated last year
- ☆31Updated 2 years ago
- Benchmark tools for LightGBM☆14Updated last year
- ☆55Updated 6 months ago
- This package implements THOR: Transformer with Stochastic Experts.☆61Updated 3 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆100Updated 4 years ago
- Lightweight Deep Learning Model Training library based on PyTorch☆32Updated 2 years ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆17Updated last year
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆42Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆19Updated last year
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Updated 2 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆61Updated last month
- [KDD'22] Learned Token Pruning for Transformers☆93Updated last year
- Research and development for optimizing transformers☆125Updated 3 years ago
- A curated list of awesome resources combining Transformers with Neural Architecture Search☆260Updated last year
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆116Updated 2 years ago
- State-of-the-art pretrained vision model from Bing Multimedia☆18Updated last year
- pytorch-profiler☆50Updated last year
- ☆237Updated 3 months ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆11Updated 3 years ago
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- Renee: End-to-end training of extreme classification models☆21Updated last year
- ☆12Updated 3 years ago
- A tracing JIT for PyTorch☆17Updated 2 years ago
- Generative Retrieval Transformer☆29Updated last year
- Training material for IPU users: tutorials, feature examples, simple applications☆87Updated last year
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL)☆44Updated 8 months ago
- Asynchronous Stochastic Gradient Descent with Delay Compensation☆21Updated 7 years ago
- ☆12Updated 2 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆14Updated 2 years ago