pytorch复现transformer
☆93Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch复现stable diffusion☆210Aug 6, 2023Updated 2 years ago
- Diffusion Transformers (DiTs) trained on MNIST dataset☆179Apr 4, 2024Updated 2 years ago
- simple decoder-only GTP model in pytorch☆44May 19, 2024Updated 2 years ago
- xgboost复现☆15Oct 6, 2024Updated last year
- ☆32Apr 6, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- ☆11Aug 9, 2021Updated 4 years ago
- A Simple framework for image restoration, it includes ECBSR, ELAN and other SOTAs.☆49Nov 13, 2022Updated 3 years ago
- CalvinFS project using C/C++☆12May 25, 2017Updated 9 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Automated Segmentation of Prohibited Items in X-ray Baggage Images Using Dense De-overlap Attention Snake, TMM 2022☆13Dec 28, 2022Updated 3 years ago
- ☆13Oct 21, 2023Updated 2 years ago
- a super easy clip model with mnist dataset for study☆175Mar 17, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Nov 7, 2022Updated 3 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 3 years ago
- Huggingface PPO Demo☆28Sep 7, 2025Updated 9 months ago
- Domain Adaptation on Point Clouds via Geometry-Aware Implicits☆26Sep 7, 2023Updated 2 years ago
- maskrcnn with Latent Graph Neural Network, experiments of "LatentGNN"(ICML2019)☆14Sep 2, 2019Updated 6 years ago
- ☆15Jan 24, 2022Updated 4 years ago
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆12Oct 20, 2023Updated 2 years ago
- Code for "Threat Scenarios and Best Practices for Neural Fake News Detection: A Case Study on COVID"☆10Nov 19, 2022Updated 3 years ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30May 27, 2026Updated 2 weeks ago
- fork of karparthy's nanogpt with custom datasets☆11Jul 25, 2023Updated 2 years ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽 取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- This repository shows the implementation of the Trained Born Iterative Method (TBIM) applied for electromagnetic imaging.☆12Nov 9, 2022Updated 3 years ago
- A script for automating XJTU postgraduate sports check-in and check-out.☆80Oct 12, 2025Updated 8 months ago
- [BSPC22]The Code of “CGRNet: Contour-Guided Graph Reasoning Network for Ambiguous Biomedical Image Segmentation”☆12Mar 11, 2022Updated 4 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- ☆14Jan 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AIRS-2025赛道二:「星际矿脉」火星矿物高光谱分类挑战赛☆12May 7, 2025Updated last year
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- 比赛论文复现☆18Feb 10, 2019Updated 7 years ago
- 基于DPO算法微调语言大模型,简单好上手。☆52Jul 3, 2024Updated last year
- ☆13Aug 7, 2021Updated 4 years ago
- fatcache学习笔记☆19Nov 17, 2014Updated 11 years ago
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year