jarobyte91 / pytorch_beam_searchLinks

A lightweight implementation of Beam Search for sequence models in PyTorch.

☆55

Alternatives and similar repositories for pytorch_beam_search

Users that are interested in pytorch_beam_search are comparing it to the libraries listed below

Sorting:

NAR-tutorial / acl2022
☆99Updated 3 years ago
LitterBrother-Xiao / Overview-of-Non-autoregressive-Applications
☆176Updated 10 months ago
huanghonggit / Mask-Language-Model
pytorch； mask language model ； bert
☆72Updated 5 years ago
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
alex-matton / causal-transformer-decoder
☆72Updated 4 years ago
formiel / speech-translation
Multilingual speech translation
☆41Updated 4 years ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆67Updated 2 years ago
mt-upc / iwslt-2021
Systems submitted to IWSLT 2021 by the MT-UPC group.
☆14Updated 2 years ago
IamAdiSri / hf-trim
Reduce the size of pretrained Hugging Face models via vocabulary trimming.
☆44Updated 2 years ago
Glaciohound / Chimera-ST
A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆48Updated 3 years ago
rosinality / imputer-pytorch
Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch
☆58Updated 5 years ago
lucidrains / charformer-pytorch
Implementation of the GBST block from the Charformer paper, in Pytorch
☆117Updated 3 years ago
luyug / GC-DPR
Train Dense Passage Retriever (DPR) with a single GPU
☆131Updated 3 years ago
nlpapereading / nlpapereading
☆59Updated 2 years ago
asappresearch / slue-toolkit
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…
☆65Updated last year
PiotrNawrot / dynamic-pooling
Efficient Transformers with Dynamic Token Pooling
☆61Updated 2 years ago
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆82Updated 3 years ago
hqsiswiliam / persona-adaptive-attention
☆25Updated last year
machelreid / diffuser
DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)
☆54Updated 2 years ago
microsoft / AdaMix
This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…
☆131Updated last year
xuchenneu / SATE
End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding
☆26Updated 3 years ago
qqaatw / pytorch-realm-orqa
PyTorch reimplementation of REALM and ORQA
☆22Updated 3 years ago
VProv / BPE-Dropout
An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.
☆52Updated 4 years ago
amazon-science / dse
☆43Updated last year
voidism / DiffCSE
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
☆294Updated 2 years ago
kahne / NonAutoregGenProgress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
☆306Updated 2 years ago
DevSinghSachan / emdr2
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…
☆109Updated 3 years ago
andrewpeng02 / transformer-translation
Using Pytorch's nn.Transformer module to create an english to french neural machine translation model.
☆78Updated 4 years ago
facebookresearch / mega
Sequence modeling with Mega.
☆295Updated 2 years ago
thu-coai / DA-Transformer
Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"
☆125Updated last year