compare the theory attention gradient with PyTorch attention gradient
☆16Apr 1, 2024Updated last year
Alternatives and similar repositories for Transformer-attention
Users that are interested in Transformer-attention are comparing it to the libraries listed below
Sorting:
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Aug 10, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆20Nov 25, 2024Updated last year
- ☆31Jul 4, 2024Updated last year
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- ☆12Feb 23, 2022Updated 4 years ago
- Open AI Gym Environment for the Dobot Magician Robotic Arm☆12Jul 9, 2018Updated 7 years ago
- ☆11Jul 4, 2022Updated 3 years ago
- Salient Objects in Clutter, arXiv, 2021 (ECCV2018 extenstion).☆11Jun 17, 2021Updated 4 years ago
- ☆13Oct 7, 2024Updated last year
- ☆12Dec 20, 2020Updated 5 years ago
- The database contains synthesized inverse synthetic aperture radar images of seven aircraft models.☆16Mar 21, 2016Updated 9 years ago
- Re-implementation of Exploiting Edge Features in Graph Neural Networks☆11Apr 7, 2022Updated 3 years ago
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- Attention-based multimodal fusion for sentiment analysis☆13Aug 14, 2018Updated 7 years ago
- ☆13Dec 2, 2025Updated 3 months ago
- ☆10Nov 22, 2022Updated 3 years ago
- Python 网络爬虫的案例,爬取的网站有豆瓣、美团、哔哩哔哩、图片资源、古诗词、广东工业大学官网等。☆12Apr 30, 2021Updated 4 years ago
- ☆10Feb 1, 2022Updated 4 years ago
- small examples to test shared layer☆11Dec 31, 2020Updated 5 years ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated 10 months ago
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- ☆14Feb 7, 2020Updated 6 years ago
- A Tensorflow SqueezeNet implementation☆14Oct 1, 2018Updated 7 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated last year
- Official PyTorch implementation for Label-Noise Robust Diffusion Models (TDSM) in ICLR 2024.☆14Apr 29, 2024Updated last year
- A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction (TIP2021)☆12Jul 7, 2022Updated 3 years ago
- Higher Order SVD implementation in PyTorch☆13Nov 14, 2022Updated 3 years ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 2 years ago
- Texture and Structure Awareness Network☆12Jul 25, 2019Updated 6 years ago
- Disentangling Factors of Variation by Mixing Them codes☆16Mar 13, 2019Updated 6 years ago
- ☆17Nov 22, 2022Updated 3 years ago
- Code for paper "FuSeConv Fully Separable Convolutions for Fast Inference on Systolic Arrays" published at DATE 2021☆18Aug 23, 2021Updated 4 years ago
- ☆15Feb 27, 2024Updated 2 years ago
- Use tensorflow to classify frames from youtube videos.☆14Oct 18, 2016Updated 9 years ago
- Some methods to sampling data points from a given distribution.☆17Jul 16, 2018Updated 7 years ago
- Official PyTorch implementation for Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Pri …☆19Apr 30, 2024Updated last year
- Pytorch Framework learning for deeplearning☆14Jan 2, 2024Updated 2 years ago
- [NeurIPS 2021 | AIJ 2024] Multi-Objective Meta Learning☆17Jul 31, 2024Updated last year
- An IPython Notebook-based tutorial on Boost.Python☆11Jan 9, 2015Updated 11 years ago