Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed
☆33Dec 11, 2024Updated last year
Alternatives and similar repositories for fast-Bart
Users that are interested in fast-Bart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using TensorRT and Triton Server to build BERT model as a service☆13Jan 10, 2022Updated 4 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆588Apr 24, 2023Updated 3 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- bert-flat 简化版 添加了很多注释☆15Nov 25, 2021Updated 4 years ago
- ☆13Jun 2, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆27Jul 8, 2022Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 3 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- 互联网敏感词,敏感词检测系统☆11Oct 12, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CCKS2021答非所问竞赛冠军方案☆27Oct 8, 2021Updated 4 years ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- ☆11Oct 9, 2022Updated 3 years ago
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- A simple RNN meta-learner☆10Dec 17, 2018Updated 7 years ago
- extractcontent.rb の python 版☆24Apr 10, 2017Updated 9 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 3 years ago
- an External Function Auto-Completion Tool to Strengthen the Static Binary Lifting☆13May 13, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11May 23, 2023Updated 2 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16May 1, 2025Updated last year
- Evaluating BERT for the Answer Selection Task.☆12Dec 8, 2022Updated 3 years ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆12Apr 21, 2026Updated 2 weeks ago
- A pytorch implementation of "Dynamic Points Agglomeration for Hierarchical Point Sets Learning" (DPAM) (ICCV2019)☆13Nov 15, 2019Updated 6 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper "XAI Beyond Classification: Interpretable Neural Clustering" (JMLR 2022)☆12Mar 12, 2022Updated 4 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- Difference English sentences via Liechtenstein distance, calculate word error rate, and list out word by word differences☆10Apr 21, 2020Updated 6 years ago
- A test website created using Django Python for a university project.☆10Jan 3, 2023Updated 3 years ago
- ☆44Jul 31, 2025Updated 9 months ago
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago