Re-implementation of Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling (T. Shen et al., ICLR 2018) on Pytorch.
☆42Feb 22, 2018Updated 8 years ago
Alternatives and similar repositories for BiBloSA-pytorch
Users that are interested in BiBloSA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bi-Directional Block Self-Attention☆122May 8, 2018Updated 8 years ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- Implementation of the Character-level Intra Attention Network (CIAN) for Natural Language Inference (NLI) upon SNLI and MultiNLI corpus☆17Nov 24, 2017Updated 8 years ago
- Source code for "A Lightweight Recurrent Network for Sequence Modeling"☆26Dec 7, 2022Updated 3 years ago
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Jul 24, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 7 months ago
- This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretrain…☆14Sep 29, 2020Updated 5 years ago
- Dynamic Spear Model☆12Jul 24, 2019Updated 6 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Apr 15, 2019Updated 7 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆28Oct 20, 2025Updated 6 months ago
- Code of Directional Self-Attention Network (DiSAN)☆311May 8, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Effective Cascade Dual-Decoder Model for Joint Entity and Relation Extraction.☆18Jan 12, 2022Updated 4 years ago
- Sparse and structured neural attention mechanisms☆224Aug 31, 2020Updated 5 years ago
- PyTorch C++ Extension Example☆15Mar 4, 2018Updated 8 years ago
- Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models☆13Oct 3, 2023Updated 2 years ago
- Aspect-augmented Adversarial Networks for Domain Adaptation☆49Feb 16, 2017Updated 9 years ago
- Phrase-Indexed Question Answering (PIQA)☆93Apr 27, 2019Updated 7 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year
- ☆18May 21, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Nov 10, 2015Updated 10 years ago
- The official code repository for MetricMT - a reward optimization method for NMT with learned metrics☆25Apr 24, 2021Updated 5 years ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 7 months ago
- AutoHallusion Codebase (EMNLP 2024)☆22Dec 6, 2024Updated last year
- Code for the paper "Neural Question Generation from Text: A Preliminary Study"☆144Jan 9, 2019Updated 7 years ago
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆31Mar 26, 2025Updated last year
- python metric functions, such as MAP, NDCG, AUC...☆10Jul 25, 2014Updated 11 years ago
- Build kaldi inside docker containers with option for CUDA support☆12Feb 6, 2017Updated 9 years ago
- Tool for Evaluating Adversarial Perturbations on Text☆61Feb 27, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Re-implementation of BIMPM (Bilateral Multi-Perspective Matching for Natural Language Sentences, Zhiguo Wang et al.) on Pytorch.☆103Oct 30, 2019Updated 6 years ago
- Source code for our journal submission : ELD-Net: An efficient deep learning architecture for accurate saliency detection☆10Nov 27, 2017Updated 8 years ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆33Feb 6, 2026Updated 3 months ago
- BiSET: Bi-directional Selective Encoding with Template for Abstractive Summarization (ACL 2019)☆41Sep 30, 2022Updated 3 years ago
- Neural discourse structure for text categorization☆12Aug 27, 2017Updated 8 years ago
- Code corresponding to our paper "A Graph-to-Sequence Model for AMR-to-Text Generation"☆137Mar 17, 2021Updated 5 years ago