pytorch复现transformer
☆93Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch复现stable diffusion☆211Aug 6, 2023Updated 2 years ago
- Diffusion Transformers (DiTs) trained on MNIST dataset☆180Apr 4, 2024Updated 2 years ago
- ECCV24, NeurIPS24, CVPR26*2, ECCV26, Benchmarking Generalized Out-of-Distribution Detection with Vision-Language Models☆40Updated this week
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆16Oct 27, 2023Updated 2 years ago
- 通义千问的DPO训练☆66Sep 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 掘金项目API 说明文档☆10Oct 15, 2025Updated 8 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated 2 years ago
- ACL24☆11Jun 7, 2024Updated 2 years ago
- ☆13Oct 21, 2023Updated 2 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- a super easy clip model with mnist dataset for study☆176Mar 17, 2024Updated 2 years ago
- ☆12Nov 7, 2022Updated 3 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 3 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unofficial PyTorch implementation of DALL-E 2 by OpenAI☆10Apr 6, 2022Updated 4 years ago
- 毕业设计: 基于深度学习的视觉问答☆13Jun 20, 2018Updated 8 years ago
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 6 years ago
- The official repository of DreamMover☆34Sep 20, 2024Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- NLP方向的论文代码复现☆14Jul 15, 2020Updated 5 years ago
- benchmark driver for "Can Learned Models Replace Hash Functions?" VLDB submission☆16Oct 31, 2023Updated 2 years ago
- simple implementation of Expected Gradients and Integrated Gradients by pytorch☆12May 11, 2022Updated 4 years ago
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆12Oct 20, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation (D-SDS) | NeurIPS 2023☆46Feb 18, 2024Updated 2 years ago
- 掼蛋AI☆13Oct 18, 2020Updated 5 years ago
- 基于大语言模型LLM 的知识图谱生成工具,支持从文本中自动提取实体关系并可视化展示。☆31Jan 24, 2025Updated last year
- 毕业设计-基于YOLOv8模型的车牌识别研究☆18May 10, 2024Updated 2 years ago
- The official repo for the code and data of paper SMART☆40Feb 20, 2025Updated last year
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆21Mar 21, 2025Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- ☆11Apr 7, 2024Updated 2 years ago
- This repository shows the implementation of the Trained Born Iterative Method (TBIM) applied for electromagnetic imaging.☆12Nov 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆122Jun 30, 2024Updated 2 years ago
- Google's MediaPipe (v0.8.9) and Python Wheel installer for Jetson Nano (JetPack 4.6) compiled for CUDA 10.2☆16Jun 7, 2023Updated 3 years ago
- Official repository for the paper “Learned Data-aware Image Representations of Line Charts for Similarity Search” (SIGMOD'23)☆13Jan 17, 2024Updated 2 years ago
- Uniform, Explicit and Implicit Laplacian Mesh Smoothing☆39Mar 3, 2018Updated 8 years ago
- 基于DPO算法微调语言大模型,简单好上手。☆52Jul 3, 2024Updated 2 years ago
- Cheetah is a system that optimizes queries using programmable switches.☆21Jun 25, 2020Updated 6 years ago
- 本科毕业设计-基于深度学习的模糊人脸图像增强系统的设计与实现☆10Jan 12, 2018Updated 8 years ago