Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
☆265Nov 30, 2024Updated last year
Alternatives and similar repositories for EET
Users that are interested in EET are comparing it to the libraries listed below
Sorting:
- Easy and Efficient Quantization for Transformers☆206Jan 28, 2026Updated last month
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,543Jul 18, 2025Updated 7 months ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,303May 16, 2023Updated 2 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,156Jan 22, 2024Updated 2 years ago
- Large-scale model inference.☆627Sep 12, 2023Updated 2 years ago
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,936Jun 12, 2023Updated 2 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- Efficient Inference for Big Models☆587Jan 24, 2023Updated 3 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Apr 24, 2023Updated 2 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,112Dec 26, 2024Updated last year
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆610Jul 11, 2024Updated last year
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- A fast MoE impl for PyTorch☆1,840Feb 10, 2025Updated last year
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- Transformer related optimization, including BERT, GPT☆6,398Mar 27, 2024Updated last year
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,051Mar 19, 2024Updated last year
- Fast Inference Solutions for BLOOM☆566Oct 9, 2024Updated last year
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,039Sep 19, 2024Updated last year
- Ongoing research training transformer models at scale☆15,461Updated this week
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆413Nov 11, 2023Updated 2 years ago
- Boosting your Web Services of Deep Learning Applications.☆1,244May 13, 2021Updated 4 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).☆85Mar 16, 2023Updated 2 years ago
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.☆2,097Jun 30, 2025Updated 8 months ago
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆541Dec 10, 2021Updated 4 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆336Jul 14, 2024Updated last year
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago