Transformer的完整实现。详细构建Encoder、Decoder、Self-attention。以实际例子进行展示,有完整的输入、训练、预测过程。可用于学习理解self-attention和Transformer
☆129Apr 10, 2025Updated last year
Alternatives and similar repositories for Self-Attention
Users that are interested in Self-Attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- clash☆21Feb 14, 2026Updated 2 months ago
- ☆10Apr 15, 2023Updated 3 years ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆45Apr 22, 2026Updated 2 weeks ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated last year
- Official Code for FedRule: Federated Rule Recommendation System with Graph Neural Networks☆14Sep 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Source code for SWIFT, an efficient reward model.☆21Jan 13, 2026Updated 3 months ago
- Function for decomposing a signal according to the Multivariate Variational Mode Decomposition in Pytorch☆29Oct 5, 2023Updated 2 years ago
- Anomaly detection is a critical step towards building a secure and trustworthy system. The primary purpose of a system log is to record s…☆14Dec 7, 2021Updated 4 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- unofficial implementation of https://arxiv.org/pdf/2301.08871v1.pdf on pytorch☆15Apr 20, 2023Updated 3 years ago
- A medical knowledge graph related website, utilizing technologies and frameworks such as Django, Bootstrap, Echarts, with MySQL and neo4j…☆12Jun 9, 2024Updated last year
- This folder contains some of the data sets commonly used in the field of multivariate time series forecasting.☆10Jul 28, 2023Updated 2 years ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆33Dec 24, 2025Updated 4 months ago
- 使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory☆58Sep 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Nov 18, 2025Updated 5 months ago
- Coming soon~☆14Jul 15, 2025Updated 9 months ago
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- [AAAI 2023] Official PyTorch implementation for "Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings…☆27Jan 18, 2023Updated 3 years ago
- The 💩DaBian programming language. 💩"答辩"编程语言, 编程不是💩"答辩"的我不学!☆10Sep 28, 2023Updated 2 years ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- The python implementation of our "UA-FedRec: Untargeted Attack on Federated News Recommendation" in KDD 2023.☆20Aug 2, 2022Updated 3 years ago
- ☆12Nov 28, 2022Updated 3 years ago
- 毕业设计-城市公交查询管理系统☆19Aug 8, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple PDF Viewer for Jetpack Compose☆22Sep 22, 2023Updated 2 years ago
- CCL2022汉语学习者文本纠错评测任务赛道二——CGED-8第一名解决方案☆54May 8, 2023Updated 3 years ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17May 24, 2024Updated last year
- qadrc - dynamic range compression☆33Aug 20, 2021Updated 4 years ago
- 📝 Summary of recommendation, advertising and search models.【推广搜技术汇总⭐】☆25Feb 2, 2023Updated 3 years ago
- Generative Modeling via Drifting in MLX☆43Feb 6, 2026Updated 3 months ago
- MCP Client Implemented to FastAPI☆10Feb 26, 2025Updated last year
- 基于 Anatole 开发的 Halo 博客主题 Knarc☆13Apr 6, 2023Updated 3 years ago
- ☆20Apr 15, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Chinese Character BERT Trained with Multi-Level Masking☆12Sep 24, 2023Updated 2 years ago
- ☆90Aug 21, 2023Updated 2 years ago
- Unofficial implementation of the algorithm described in the paper Federated Collaborative Filtering for Privacy-Preserving Personalized R…☆22Oct 29, 2021Updated 4 years ago
- ☆12Mar 23, 2024Updated 2 years ago
- pre-training llama3 using chinese☆13May 1, 2024Updated 2 years ago
- Hyperledger Indy/Sovrin/DID Comprehensive Architecture Reference Model (INDY ARM) - Draft document for discussion purposes☆14Jan 25, 2021Updated 5 years ago
- 记录自己对《代码审计》的理解和总结,对危险函数的深入分析以及在p牛的博客和代码审计圈的收获☆10Feb 27, 2018Updated 8 years ago