关于Transformer模型的最简洁pytorch实现,包含详细注释
☆240Nov 13, 2023Updated 2 years ago
Alternatives and similar repositories for MyTransformer_pytorch
Users that are interested in MyTransformer_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆49Apr 15, 2024Updated 2 years ago
- 51job,猎聘,拉勾,智联,Boss直聘 爬虫,使用scrapy框架和crawlab平台☆18Sep 10, 2020Updated 5 years ago
- Triton Compiler related materials.☆44Mar 16, 2026Updated 2 months ago
- pytorch实现聊天机器人,seq2seq模型☆10Feb 9, 2020Updated 6 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆233Feb 5, 2022Updated 4 years ago
- TensorRT encapsulation, learn, rewrite, practice.☆30Oct 19, 2022Updated 3 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆51Jun 15, 2023Updated 2 years ago
- YOLOv5 Quantization Aware Training (QAT, qat_torch branch) and Post Training Quantization with ONNX (ptq_onnx branch ptq_onnx.ipynb)☆14Feb 28, 2023Updated 3 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- ☆48Nov 1, 2025Updated 7 months ago
- basic framework for rag(retrieval augment generation)☆89Dec 24, 2023Updated 2 years ago
- Compact and Agent-Native MoE Training System☆144Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python implementation of Word2Vec using skip-gram and negative sampling☆12Dec 8, 2015Updated 10 years ago
- A track playback software for csv file developed by QT☆11Feb 6, 2022Updated 4 years ago
- 对llava官方代码的一些学习笔记☆29Oct 11, 2024Updated last year
- 支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署☆13Mar 17, 2024Updated 2 years ago
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Roll model for trading strategy to C++ or FPGA via Matlab tool☆10Sep 11, 2014Updated 11 years ago
- Code and dataset for the paper 'Optimized Prediction of Weapon Effectiveness in BVR Air Combat Scenarios Using Enhanced Regression Models…☆17Jun 29, 2025Updated 11 months ago
- 基于ROS的多无人机协同控制☆12May 8, 2021Updated 5 years ago
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AI Challenger Image Caption Competition☆10Dec 13, 2017Updated 8 years ago
- Interactive Multi-Agent Reinforcement Learning Environment for the board game Gobblet using PettingZoo.☆12Jul 2, 2023Updated 2 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 11 months ago
- Assignment for course AE4301P. Development of control system for an F16 model.☆12Dec 17, 2018Updated 7 years ago
- ☆26Jan 3, 2025Updated last year
- ☆14Jan 15, 2023Updated 3 years ago
- A highly efficient library for GEMM operations on Sunway TaihuLight☆18Sep 7, 2020Updated 5 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆33Oct 18, 2022Updated 3 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- High performance inference engine for diffusion models☆108Sep 5, 2025Updated 9 months ago
- Capstone Research Project in NYU Courant☆12Jan 3, 2020Updated 6 years ago
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- A test driver to demo device & device_driver.☆12Apr 26, 2016Updated 10 years ago
- ☆21Jul 19, 2024Updated last year
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆12May 19, 2026Updated 3 weeks ago