pytorch复现transformer
☆93Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for pytorch-transformer
Users that are interested in pytorch-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch复现stable diffusion☆210Aug 6, 2023Updated 2 years ago
- Diffusion Transformers (DiTs) trained on MNIST dataset☆178Apr 4, 2024Updated 2 years ago
- simple decoder-only GTP model in pytorch☆44May 19, 2024Updated 2 years ago
- ☆18Apr 15, 2021Updated 5 years ago
- A simple, easy-to-hack GraphRAG implementation☆15Sep 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LLM Tokenizer with BPE algorithm☆49May 7, 2024Updated 2 years ago
- ☆23Oct 20, 2020Updated 5 years ago
- Bridge Autoware and Carla with Zenoh☆19Apr 9, 2026Updated last month
- Official codes and datasets for ACM MM23 paper "3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Mod…☆26Sep 13, 2024Updated last year
- 基于Python+Vue开发的农产品商城管理系统,课程设计/毕业设计☆10Oct 31, 2024Updated last year
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- a super easy clip model with mnist dataset for study☆174Mar 17, 2024Updated 2 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Huggingface PPO Demo☆28Sep 7, 2025Updated 8 months ago
- [3DV 2024] Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization☆34Mar 17, 2025Updated last year
- Domain Adaptation on Point Clouds via Geometry-Aware Implicits☆26Sep 7, 2023Updated 2 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- 使用手势识别算法玩俄罗斯方块☆10Mar 30, 2021Updated 5 years ago
- 毕业设计: 基于深度学习的视觉问答☆14Jun 20, 2018Updated 7 years ago
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 6 years ago
- NLP方向的论文代码复现☆14Jul 15, 2020Updated 5 years ago
- Edge-Aware Mirror Network for Camouflaged Object Detection (EAMNet, IEEE ICME 2023).☆13Jul 8, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Copy Telegram Signal to your Metatrader account MT4 - MT5 for automated trade copying☆15Aug 25, 2020Updated 5 years ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆12Jun 10, 2024Updated last year
- Geometric Superpixel Representations for Efficient Image Classification with Graph Neural Networks☆13Aug 10, 2023Updated 2 years ago
- Code for "Threat Scenarios and Best Practices for Neural Fake News Detection: A Case Study on COVID"☆10Nov 19, 2022Updated 3 years ago
- Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation (D-SDS) | NeurIPS 2023☆46Feb 18, 2024Updated 2 years ago
- 基于大语言模型LLM 的知识图谱生成工具,支持从文本中自动提取实体关系并可视化展示。☆29Jan 24, 2025Updated last year
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- fork of karparthy's nanogpt with custom datasets☆11Jul 25, 2023Updated 2 years ago
- ☆22Jan 23, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- ☆11Apr 7, 2024Updated 2 years ago
- ☆122Jun 30, 2024Updated last year
- [BSPC22]The Code of “CGRNet: Contour-Guided Graph Reasoning Network for Ambiguous Biomedical Image Segmentation”☆12Mar 11, 2022Updated 4 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago
- Google's MediaPipe (v0.8.9) and Python Wheel installer for Jetson Nano (JetPack 4.6) compiled for CUDA 10.2☆16Jun 7, 2023Updated 2 years ago