compare the theory attention gradient with PyTorch attention gradient
☆16Apr 1, 2024Updated last year
Alternatives and similar repositories for Transformer-attention
Users that are interested in Transformer-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆20Nov 25, 2024Updated last year
- ☆11Jul 4, 2022Updated 3 years ago
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Aug 10, 2023Updated 2 years ago
- The database contains synthesized inverse synthetic aperture radar images of seven aircraft models.☆16Mar 21, 2016Updated 10 years ago
- ☆13Dec 2, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The interface between probabilistic model checking and data-driven policy learning.☆17Updated this week
- Official PyTorch implementation for Label-Noise Robust Diffusion Models (TDSM) in ICLR 2024.☆15Apr 29, 2024Updated last year
- Official PyTorch implementation for Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Pri…☆19Apr 30, 2024Updated last year
- Open AI Gym Environment for the Dobot Magician Robotic Arm☆12Jul 9, 2018Updated 7 years ago
- Official PyTorch code for CVPR 2021 paper "AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implici…☆24Oct 26, 2022Updated 3 years ago
- Salient Objects in Clutter, arXiv, 2021 (ECCV2018 extenstion).☆11Jun 17, 2021Updated 4 years ago
- ☆23Mar 17, 2026Updated last week
- ☆18May 23, 2021Updated 4 years ago
- Official PyTorch implementation of "Loss-Curvature Matching for Dataset Selection and Condensation" (AISTATS 2023)☆22Mar 14, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆23Jun 25, 2022Updated 3 years ago
- ☆31Jul 4, 2024Updated last year
- ☆14Feb 7, 2020Updated 6 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- ☆24Apr 29, 2024Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning☆37Dec 18, 2025Updated 3 months ago
- Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]☆31May 7, 2024Updated last year
- A Gateway for connecting application services in different domains, networks, and cloud infrastructures☆23Feb 1, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated last year
- Code for paper "FuSeConv Fully Separable Convolutions for Fast Inference on Systolic Arrays" published at DATE 2021☆18Aug 23, 2021Updated 4 years ago
- Re-implementation of Exploiting Edge Features in Graph Neural Networks☆11Apr 7, 2022Updated 3 years ago
- Sparsity-Driven ISAR Imaging Based on Two-Dimensional ADMM☆32Jun 7, 2025Updated 9 months ago
- A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction (TIP2021)☆13Jul 7, 2022Updated 3 years ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 2 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- pytorch implementation of ABC : Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning☆39Nov 20, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆25Nov 9, 2024Updated last year
- Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specificat…☆53Mar 10, 2026Updated 2 weeks ago
- Tools for auto-generating the battery-materials database.☆50Sep 6, 2022Updated 3 years ago
- ☆20Jul 5, 2024Updated last year
- ☆12Feb 23, 2022Updated 4 years ago
- ☆21Mar 10, 2021Updated 5 years ago
- Python 网络爬虫的案例,爬取的网站有豆瓣、美团、哔哩哔哩、图片资源、古诗词、广东工业大学官网等。☆12Apr 30, 2021Updated 4 years ago