Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed
☆33Dec 11, 2024Updated last year
Alternatives and similar repositories for fast-Bart
Users that are interested in fast-Bart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using TensorRT and Triton Server to build BERT model as a service☆13Jan 10, 2022Updated 4 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆588Apr 24, 2023Updated 3 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- bert-flat 简化版 添加了很多注释☆15Nov 25, 2021Updated 4 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jun 2, 2022Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆27Jul 8, 2022Updated 3 years ago
- ☆30May 30, 2022Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 4 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CCKS2021答非所问竞赛冠军方案☆27Oct 8, 2021Updated 4 years ago
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- A simple RNN meta-learner☆10Dec 17, 2018Updated 7 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- ☆13Apr 27, 2022Updated 4 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 3 years ago
- an External Function Auto-Completion Tool to Strengthen the Static Binary Lifting☆13May 13, 2024Updated 2 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- Evaluating BERT for the Answer Selection Task.☆12Dec 8, 2022Updated 3 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Code for the paper "XAI Beyond Classification: Interpretable Neural Clustering" (JMLR 2022)☆12Mar 12, 2022Updated 4 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- A mod that injects MGL and patches Minecraft to work with it.☆12Apr 10, 2024Updated 2 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- WordPress plugin to add feature of Japanese proofreading☆11Jul 31, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 5 years ago
- ☆59Apr 24, 2021Updated 5 years ago
- A Japanese language stemming algorithm☆11Feb 3, 2019Updated 7 years ago
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆14Oct 4, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago