Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed
☆33Dec 11, 2024Updated last year
Alternatives and similar repositories for fast-Bart
Users that are interested in fast-Bart are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆589Apr 24, 2023Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- bert-flat 简化版 添加了很多注释☆15Nov 25, 2021Updated 4 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- ☆13Jun 2, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆27Jul 8, 2022Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- Official code of our work, Representation Learning for Resource-Constrained Keyphrase Generation.☆11May 26, 2022Updated 3 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- ☆10Apr 6, 2022Updated 4 years ago
- ☆11Oct 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- extractcontent.rb の python 版☆24Apr 10, 2017Updated 9 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- ☆13Apr 27, 2022Updated 3 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 3 years ago
- ☆11May 23, 2023Updated 2 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Checkout the new version at the link!☆22Dec 11, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.☆16May 1, 2025Updated 11 months ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆12Apr 20, 2025Updated 11 months ago
- A pytorch implementation of "Dynamic Points Agglomeration for Hierarchical Point Sets Learning" (DPAM) (ICCV2019)☆13Nov 15, 2019Updated 6 years ago
- Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack☆11Jan 14, 2021Updated 5 years ago
- Difference English sentences via Liechtenstein distance, calculate word error rate, and list out word by word differences☆10Apr 21, 2020Updated 5 years ago
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- Hybrid List Aware Transformer Reranking☆20Oct 25, 2022Updated 3 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 5 years ago
- ☆59Apr 24, 2021Updated 4 years ago
- A Japanese language stemming algorithm☆11Feb 3, 2019Updated 7 years ago
- ☆32Oct 30, 2023Updated 2 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆14Oct 4, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago