Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)
☆43Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for adaptive_transformer
Users that are interested in adaptive_transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Apr 7, 2020Updated 6 years ago
- ☆23Oct 20, 2020Updated 5 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Aug 13, 2020Updated 5 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- Code for scaling Transformers☆26Dec 2, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Sparse Transformer with limited attention span in PyTorch☆15Apr 4, 2021Updated 5 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 5 years ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35May 2, 2020Updated 6 years ago
- Attention-based multimodal fusion for sentiment analysis☆13Aug 14, 2018Updated 7 years ago
- kaggle情感分析rnn+attention解法☆12Nov 17, 2017Updated 8 years ago
- Paper and code for Gradient Descent: The Ultimate Optimizer☆24Oct 3, 2023Updated 2 years ago
- Implementation of the paper "Emotion Identification from raw speech signals using DNNs"☆14Jun 11, 2020Updated 5 years ago
- Code for the paper PermuteFormer☆42Oct 10, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TGLS: Unsupervised Text Generation by Learning from Search☆25Jan 5, 2021Updated 5 years ago
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Apr 10, 2021Updated 5 years ago
- Code for Generalized Entropy Regularization paper☆14May 2, 2020Updated 6 years ago
- 📒Record some paper read notes☆21Jan 1, 2022Updated 4 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Multimodal preprocessing on IEMOCAP dataset☆13Jun 8, 2018Updated 7 years ago
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Oct 8, 2020Updated 5 years ago
- TBC☆28Nov 2, 2022Updated 3 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ACL'2020: Contextualized Sparse Representations for Real-Time Open-Domain Question Answering☆49Oct 6, 2020Updated 5 years ago
- Code to run the TILT transfer learning experiments☆33Feb 13, 2021Updated 5 years ago
- The Transformer in PyTorch☆13Aug 7, 2024Updated last year
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆17Jul 25, 2024Updated last year
- ☆24Jan 20, 2021Updated 5 years ago
- A simple Transformer where the softmax has been replaced with normalization☆20Sep 11, 2020Updated 5 years ago
- ☆11Jun 6, 2023Updated 2 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 3 years ago
- Implementation of HITS algorithm for finding hub and authority scores for twitter users☆12Dec 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆217Dec 5, 2021Updated 4 years ago
- The appendix and core code of model CauSTG, for accepted paper in KDD 2023.☆12Jun 15, 2023Updated 2 years ago
- PyTorch implementation of the End-to-End Memory Network with attention layer vizualisation support.☆12Jun 30, 2018Updated 7 years ago
- ACM MULTIMEDIA CONFERENCE 2020☆11Jul 28, 2020Updated 5 years ago
- [ACL'19] [PyTorch] Multimodal Transformer☆988Sep 12, 2022Updated 3 years ago
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆43Nov 8, 2020Updated 5 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Jun 20, 2021Updated 4 years ago