Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
☆263Nov 30, 2024Updated last year
Alternatives and similar repositories for EET
Users that are interested in EET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy and Efficient Quantization for Transformers☆205Mar 25, 2026Updated last month
- ☆13Aug 23, 2024Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,548Jul 18, 2025Updated 10 months ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,301May 16, 2023Updated 3 years ago
- Serving Inside Pytorch☆170Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,686Oct 23, 2024Updated last year
- Efficient Inference for Big Models☆584Jan 24, 2023Updated 3 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,110May 9, 2024Updated 2 years ago
- ACL 2020: ScriptWriter: Narrative-Guided Script Generation☆33May 24, 2022Updated 3 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,162Jan 22, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,948Jun 12, 2023Updated 2 years ago
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 5 years ago
- A fast MoE impl for PyTorch☆1,850Feb 10, 2025Updated last year
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,117Dec 26, 2024Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆788Apr 24, 2023Updated 3 years ago
- Boosting your Web Services of Deep Learning Applications.☆1,243May 13, 2021Updated 5 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆610Jul 11, 2024Updated last year
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 5 years ago
- Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"☆31Apr 17, 2021Updated 5 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆235Dec 10, 2025Updated 5 months ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆220Jun 22, 2023Updated 2 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆476Mar 7, 2024Updated 2 years ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆1,044Sep 19, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆6,416Mar 27, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GLM (General Language Model)☆24Mar 7, 2022Updated 4 years ago
- Fast Inference Solutions for BLOOM☆566Oct 9, 2024Updated last year
- Large-scale model inference.☆629Sep 12, 2023Updated 2 years ago
- 基于知识图谱的人物关系可视化及问答系统☆10Aug 24, 2018Updated 7 years ago
- mWER loss implementation in tensorflow☆31Sep 7, 2020Updated 5 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- ☆19Feb 25, 2023Updated 3 years ago