Re-implementation of Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling (T. Shen et al., ICLR 2018) on Pytorch.
☆42Feb 22, 2018Updated 8 years ago
Alternatives and similar repositories for BiBloSA-pytorch
Users that are interested in BiBloSA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bi-Directional Block Self-Attention☆122May 8, 2018Updated 8 years ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- Implementation of the Character-level Intra Attention Network (CIAN) for Natural Language Inference (NLI) upon SNLI and MultiNLI corpus☆17Nov 24, 2017Updated 8 years ago
- A summary of my recently surveyed papers. Some papers on Arxiv with unimpressive results are not included.☆25Apr 18, 2018Updated 8 years ago
- ☆20Apr 12, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Nov 25, 2022Updated 3 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 8 months ago
- This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretrain…☆14Sep 29, 2020Updated 5 years ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 8 months ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 5 years ago
- Alex Graves' Adaptive Computation Time in PyTorch☆14Jan 9, 2018Updated 8 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Apr 15, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Mar 24, 2023Updated 3 years ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆28Oct 20, 2025Updated 7 months ago
- Code of Directional Self-Attention Network (DiSAN)☆311May 8, 2018Updated 8 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Sparse and structured neural attention mechanisms☆224Aug 31, 2020Updated 5 years ago
- PyTorch C++ Extension Example☆15Mar 4, 2018Updated 8 years ago
- toy ccg parser☆14Apr 14, 2016Updated 10 years ago
- Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models☆13Oct 3, 2023Updated 2 years ago
- Aspect-augmented Adversarial Networks for Domain Adaptation☆49Feb 16, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Phrase-Indexed Question Answering (PIQA)☆93Apr 27, 2019Updated 7 years ago
- ☆18May 21, 2018Updated 8 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- Bi-directional streaming speech-to-text service using Cloud ASRs☆15Aug 23, 2017Updated 8 years ago
- Repository for NLI models (EMNLP 2018)☆61Nov 14, 2018Updated 7 years ago
- Self-Paced Multi-view Co-training for person re-id experiment☆30Jun 9, 2021Updated 5 years ago
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Nov 10, 2015Updated 10 years ago
- An implementation for MLLM oversensitivity evaluation☆18Nov 16, 2024Updated last year
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"☆16Mar 23, 2024Updated 2 years ago
- AutoHallusion Codebase (EMNLP 2024)☆23Dec 6, 2024Updated last year
- Pytorch implementation of WWW'23:"Auto-HeG: Automated Graph Neural Network on Heterophilic Graphs"☆16Jul 2, 2023Updated 2 years ago
- Code for the paper "Neural Question Generation from Text: A Preliminary Study"☆144Jan 9, 2019Updated 7 years ago
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆31Mar 26, 2025Updated last year
- ☆13May 16, 2016Updated 10 years ago
- python metric functions, such as MAP, NDCG, AUC...☆10Jul 25, 2014Updated 11 years ago