Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed
☆33Dec 11, 2024Updated last year
Alternatives and similar repositories for fast-Bart
Users that are interested in fast-Bart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using TensorRT and Triton Server to build BERT model as a service☆13Jan 10, 2022Updated 4 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆588Apr 24, 2023Updated 3 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- bert-flat 简化版 添加了很多注释☆15Nov 25, 2021Updated 4 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jun 2, 2022Updated 4 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆27Jul 8, 2022Updated 3 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 4 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- 互联网敏感词,敏感词检测系统☆11Oct 12, 2025Updated 8 months ago
- CCKS2021答非所问竞赛冠军方案☆27Oct 8, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Apr 6, 2022Updated 4 years ago
- ☆11Oct 9, 2022Updated 3 years ago
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- A simple RNN meta-learner☆10Dec 17, 2018Updated 7 years ago
- extractcontent.rb の python 版☆24Apr 10, 2017Updated 9 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- ☆13Apr 27, 2022Updated 4 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16May 1, 2025Updated last year
- A pytorch implementation of "Dynamic Points Agglomeration for Hierarchical Point Sets Learning" (DPAM) (ICCV2019)☆13Nov 15, 2019Updated 6 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Code for the paper "XAI Beyond Classification: Interpretable Neural Clustering" (JMLR 2022)☆12Mar 12, 2022Updated 4 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- Difference English sentences via Liechtenstein distance, calculate word error rate, and list out word by word differences☆10Apr 21, 2020Updated 6 years ago
- A test website created using Django Python for a university project.☆10Jan 3, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- End-to-end neural table-text understanding models.☆10Nov 11, 2020Updated 5 years ago
- Hybrid List Aware Transformer Reranking☆20Oct 25, 2022Updated 3 years ago
- WordPress plugin to add feature of Japanese proofreading☆11Jul 31, 2021Updated 4 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 6 years ago
- A Japanese language stemming algorithm☆11Feb 3, 2019Updated 7 years ago