Re-implementation of Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling (T. Shen et al., ICLR 2018) on Pytorch.
☆42Feb 22, 2018Updated 8 years ago
Alternatives and similar repositories for BiBloSA-pytorch
Users that are interested in BiBloSA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bi-Directional Block Self-Attention☆122May 8, 2018Updated 7 years ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- Implementation of the Character-level Intra Attention Network (CIAN) for Natural Language Inference (NLI) upon SNLI and MultiNLI corpus☆17Nov 24, 2017Updated 8 years ago
- Source code for "A Lightweight Recurrent Network for Sequence Modeling"☆26Dec 7, 2022Updated 3 years ago
- A summary of my recently surveyed papers. Some papers on Arxiv with unimpressive results are not included.☆25Apr 18, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆20Apr 12, 2017Updated 8 years ago
- ☆18Nov 25, 2022Updated 3 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Jul 24, 2022Updated 3 years ago
- This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretrain…☆14Sep 29, 2020Updated 5 years ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 6 months ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 6 months ago
- Dynamic Spear Model☆12Jul 24, 2019Updated 6 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Alex Graves' Adaptive Computation Time in PyTorch☆14Jan 9, 2018Updated 8 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Apr 15, 2019Updated 6 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- Code of Directional Self-Attention Network (DiSAN)☆311May 8, 2018Updated 7 years ago
- Effective Cascade Dual-Decoder Model for Joint Entity and Relation Extraction.☆18Jan 12, 2022Updated 4 years ago
- toy ccg parser☆14Apr 14, 2016Updated 9 years ago
- Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models☆13Oct 3, 2023Updated 2 years ago
- Aspect-augmented Adversarial Networks for Domain Adaptation☆49Feb 16, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Phrase-Indexed Question Answering (PIQA)☆93Apr 27, 2019Updated 6 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 11 months ago
- ☆18May 21, 2018Updated 7 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- An very convenient Audio Recorder For ASR Projects. It can recording 16K 16Bit Wav files for ASR projects for the next recognizing. it u…☆13Jun 25, 2019Updated 6 years ago
- Repository for NLI models (EMNLP 2018)☆61Nov 14, 2018Updated 7 years ago
- Self-Paced Multi-view Co-training for person re-id experiment☆30Jun 9, 2021Updated 4 years ago
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Nov 10, 2015Updated 10 years ago
- An implementation for MLLM oversensitivity evaluation☆18Nov 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official code repository for MetricMT - a reward optimization method for NMT with learned metrics☆25Apr 24, 2021Updated 4 years ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 6 months ago
- Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"☆16Mar 23, 2024Updated 2 years ago
- AutoHallusion Codebase (EMNLP 2024)☆22Dec 6, 2024Updated last year
- Pytorch implementation of WWW'23:"Auto-HeG: Automated Graph Neural Network on Heterophilic Graphs"☆16Jul 2, 2023Updated 2 years ago
- Code for the paper "Neural Question Generation from Text: A Preliminary Study"☆144Jan 9, 2019Updated 7 years ago
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆28Mar 26, 2025Updated last year