pytorch复现transformer
☆92Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch复现stable diffusion☆211Aug 6, 2023Updated 2 years ago
- vision transformer on mnist dataset☆48Mar 24, 2024Updated 2 years ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆172Apr 4, 2024Updated 2 years ago
- ☆31Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- xgboost复现☆15Oct 6, 2024Updated last year
- 通义千问 SFT试验☆82Jan 6, 2024Updated 2 years ago
- ☆23Oct 20, 2020Updated 5 years ago
- 通义千问的DPO训练☆64Sep 21, 2024Updated last year
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Balatro calculator☆17Mar 31, 2026Updated last week
- Official codes and datasets for ACM MM23 paper "3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Mod…☆26Sep 13, 2024Updated last year
- a super easy clip model with mnist dataset for study☆172Mar 17, 2024Updated 2 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- VQVAE for video prediction☆31Apr 22, 2022Updated 3 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- 使用手势识别算法玩俄罗斯方块☆10Mar 30, 2021Updated 5 years ago
- Unofficial PyTorch implementation of DALL-E 2 by OpenAI☆10Apr 6, 2022Updated 4 years ago
- 毕业设计-基于YOLOv8模型的车牌识别研究☆18May 10, 2024Updated last year
- 毕业设计: 基于深度学习的视觉问答☆14Jun 20, 2018Updated 7 years ago
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 6 years ago
- NLP方向的论文代码复现☆14Jul 15, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Flash Attention in ~100 lines of CUDA (forward pass only)☆12Jun 10, 2024Updated last year
- 图书管理系统课设(C# + Mysql)☆14Apr 20, 2023Updated 2 years ago
- Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation (D-SDS) | NeurIPS 2023☆46Feb 18, 2024Updated 2 years ago
- 掼蛋AI☆13Oct 18, 2020Updated 5 years ago
- ☆13Nov 10, 2024Updated last year
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- fork of karparthy's nanogpt with custom datasets☆10Jul 25, 2023Updated 2 years ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- ☆11Apr 7, 2024Updated 2 years ago
- ☆72Oct 12, 2025Updated 6 months ago
- ☆122Jun 30, 2024Updated last year
- bilibili视频讲解所使用的课件代码记录☆36Mar 24, 2026Updated 2 weeks ago
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆23Oct 26, 2025Updated 5 months ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago