Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
☆264Nov 30, 2024Updated last year
Alternatives and similar repositories for EET
Users that are interested in EET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy and Efficient Quantization for Transformers☆206Mar 25, 2026Updated 2 weeks ago
- ☆13Aug 23, 2024Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,545Jul 18, 2025Updated 8 months ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,300May 16, 2023Updated 2 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,688Oct 23, 2024Updated last year
- Efficient Inference for Big Models☆586Jan 24, 2023Updated 3 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- ACL 2020: ScriptWriter: Narrative-Guided Script Generation☆33May 24, 2022Updated 3 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,156Jan 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,941Jun 12, 2023Updated 2 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 4 years ago
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- A fast MoE impl for PyTorch☆1,847Feb 10, 2025Updated last year
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,116Dec 26, 2024Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆789Apr 24, 2023Updated 2 years ago
- Boosting your Web Services of Deep Learning Applications.☆1,244May 13, 2021Updated 4 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆611Jul 11, 2024Updated last year
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 5 years ago
- Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"☆32Apr 17, 2021Updated 4 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆232Dec 10, 2025Updated 4 months ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆476Mar 7, 2024Updated 2 years ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,041Sep 19, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆6,410Mar 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GLM (General Language Model)☆24Mar 7, 2022Updated 4 years ago
- Fast Inference Solutions for BLOOM☆566Oct 9, 2024Updated last year
- Large-scale model inference.☆628Sep 12, 2023Updated 2 years ago
- 基于知识图谱的人物关系可视化及问答系统☆10Aug 24, 2018Updated 7 years ago
- mWER loss implementation in tensorflow☆31Sep 7, 2020Updated 5 years ago
- Implementation of AAAI2021 paper "Writing Polishment with Simile: Task, Dataset and A Neural Approach"☆21Dec 25, 2020Updated 5 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago