compare the theory attention gradient with PyTorch attention gradient
☆16Apr 1, 2024Updated 2 years ago
Alternatives and similar repositories for Transformer-attention
Users that are interested in Transformer-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆22Nov 25, 2024Updated last year
- ☆11Jul 4, 2022Updated 3 years ago
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Aug 10, 2023Updated 2 years ago
- The database contains synthesized inverse synthetic aperture radar images of seven aircraft models.☆16Mar 21, 2016Updated 10 years ago
- ☆13Dec 2, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The interface between probabilistic model checking and data-driven policy learning.☆19Apr 21, 2026Updated 2 weeks ago
- Official PyTorch implementation for Label-Noise Robust Diffusion Models (TDSM) in ICLR 2024.☆15Apr 29, 2024Updated 2 years ago
- Official PyTorch implementation for Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Pri…☆19Apr 30, 2024Updated 2 years ago
- Open AI Gym Environment for the Dobot Magician Robotic Arm☆12Jul 9, 2018Updated 7 years ago
- Official PyTorch code for CVPR 2021 paper "AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implici…☆24Oct 26, 2022Updated 3 years ago
- Salient Objects in Clutter, arXiv, 2021 (ECCV2018 extenstion).☆11Jun 17, 2021Updated 4 years ago
- ☆24Mar 25, 2026Updated last month
- ☆18May 23, 2021Updated 4 years ago
- Official PyTorch implementation of "Loss-Curvature Matching for Dataset Selection and Condensation" (AISTATS 2023)☆22Mar 14, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆23Jun 25, 2022Updated 3 years ago
- ☆33Jul 4, 2024Updated last year
- ☆14Feb 7, 2020Updated 6 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- ☆24Apr 29, 2024Updated 2 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning☆41Apr 3, 2026Updated last month
- Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]☆31May 7, 2024Updated last year
- A Gateway for connecting application services in different domains, networks, and cloud infrastructures☆23Feb 1, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated last year
- Code for paper "FuSeConv Fully Separable Convolutions for Fast Inference on Systolic Arrays" published at DATE 2021☆18Aug 23, 2021Updated 4 years ago
- Re-implementation of Exploiting Edge Features in Graph Neural Networks☆11Apr 7, 2022Updated 4 years ago
- Sparsity-Driven ISAR Imaging Based on Two-Dimensional ADMM☆33Jun 7, 2025Updated 10 months ago
- A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction (TIP2021)☆13Jul 7, 2022Updated 3 years ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 3 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- pytorch implementation of ABC : Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning☆37Nov 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specificat…☆61Apr 27, 2026Updated last week
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆25Nov 9, 2024Updated last year
- Tools for auto-generating the battery-materials database.☆50Sep 6, 2022Updated 3 years ago
- ☆12Feb 23, 2022Updated 4 years ago
- ☆21Jul 5, 2024Updated last year
- ☆21Mar 10, 2021Updated 5 years ago
- Python 网络爬虫的案 例,爬取的网站有豆瓣、美团、哔哩哔哩、图片资源、古诗词、广东工业大学官网等。☆12Apr 30, 2021Updated 5 years ago