A pytorch Implementation of the Transformer: Attention Is All You Need
☆14Jun 7, 2024Updated last year
Alternatives and similar repositories for transformer-pytorch
Users that are interested in transformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Feb 3, 2026Updated last month
- Official Repository for the ICLR 2022 paper "Generalization of Neural Combinatorial Solvers through the Lens of Adversarial Robustness"☆13Nov 20, 2022Updated 3 years ago
- Sparking Using Java8☆17Feb 28, 2015Updated 11 years ago
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Dec 2, 2025Updated 3 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An intelligent MCP server that provides tools for collecting and documenting code from directories☆15Dec 22, 2024Updated last year
- GolangGin框架中文快速入门文档☆10Jan 18, 2021Updated 5 years ago
- security后端demo集成JWT☆16Jul 12, 2025Updated 8 months ago
- ☆12Sep 15, 2021Updated 4 years ago
- Apache Spark Examples☆17Aug 6, 2016Updated 9 years ago
- ☆12Mar 18, 2022Updated 4 years ago
- commons-pools使用示例☆15Jun 15, 2016Updated 9 years ago
- xgboost复现☆15Oct 6, 2024Updated last year
- The repository for 'Unsupervised Learning for Combinatorial Optimization with Principled Proxy Design'☆16Oct 9, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Matlab code for Low rank Matrix Factorization with AMP☆17Aug 18, 2015Updated 10 years ago
- Trial Reasoner for AI that Learns☆18Sep 17, 2025Updated 6 months ago
- ☆10Apr 15, 2023Updated 2 years ago
- PyTorch-Geometric Implementation of MarkovGNN method published in Graph Learning@WWW 2022 titled "MarkovGNN: Graph Neural Networks on Mar…☆13Feb 8, 2022Updated 4 years ago
- ☆17Nov 7, 2024Updated last year
- [EMNLP 2023] ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.☆22Jan 28, 2024Updated 2 years ago
- ☆22Oct 31, 2024Updated last year
- I'm an AI assistant with extensive knowledge in psychology, and my name is Care.☆25Aug 25, 2025Updated 7 months ago
- An automatic prompt iteration and optimization generator suitable for any scenario☆16Jan 31, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repo implements and trains Vision Transformer (VIT) on a synthetically generated dataset which has colored mnist images on texture b…☆24Feb 6, 2024Updated 2 years ago
- ☆24Mar 15, 2022Updated 4 years ago
- A medical knowledge graph related website, utilizing technologies and frameworks such as Django, Bootstrap, Echarts, with MySQL and neo4j…☆12Jun 9, 2024Updated last year
- 一个存放各种方向上 HelloWorld级别的 入门/介绍程序及环境搭建文章及相关分析/分享视频的仓库......☆19Oct 17, 2024Updated last year
- code for "Neural Jump Ordinary Differential Equations"☆31Feb 16, 2023Updated 3 years ago
- ☆19Aug 9, 2024Updated last year
- ☆19Feb 20, 2025Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- Implementation for the paper "Extrapolating paths with graph neural networks"☆37May 22, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for "Decision-Focused Learning without Differentiable Optimization: Learning Locally Optimized Decision Losses"☆31Mar 18, 2024Updated 2 years ago
- ☆35Sep 14, 2024Updated last year
- ☆30Oct 10, 2021Updated 4 years ago
- Akka学习的Demo☆33Feb 14, 2016Updated 10 years ago
- ☆38Jan 17, 2025Updated last year
- ☆114Jun 12, 2025Updated 9 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18Dec 28, 2023Updated 2 years ago