国产加速卡-海光DCU实战(大模型训练、微调、推理 等)
☆74Aug 10, 2025Updated 8 months ago
Alternatives and similar repositories for dcu-in-action
Users that are interested in dcu-in-action are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CUDA benchmarks for measuring GPU utilization and interference☆16Feb 11, 2025Updated last year
- MCP实战☆104Jul 16, 2025Updated 9 months ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- CSDN of ManVictor☆22Mar 31, 2025Updated last year
- ☆14May 6, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆20Updated this week
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 5 months ago
- A Triton JIT runtime and ffi provider in C++☆33Apr 28, 2026Updated last week
- ☆20Sep 28, 2024Updated last year
- Manages vllm-nccl dependency☆18Jun 3, 2024Updated last year
- Development using Verilog programing language and Vivado IDE .☆14Dec 14, 2019Updated 6 years ago
- ☆28Oct 14, 2024Updated last year
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆15Jan 16, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆38Nov 14, 2024Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆56Nov 22, 2025Updated 5 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- An implementation of MSSRM method☆10Mar 23, 2023Updated 3 years ago
- patches for huggingface transformers to save memory☆37Jun 2, 2025Updated 11 months ago
- ☆58Jan 25, 2021Updated 5 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge☆40Sep 17, 2025Updated 7 months ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- PCA Face Recognition & Emotion Detection API based on KoaJS☆10May 21, 2023Updated 2 years ago
- ☆15Nov 2, 2024Updated last year
- InfiniBand SR-IOV CNI☆13Apr 15, 2026Updated 3 weeks ago
- A model serving framework for various research and production scenarios. Seamlessly built upon the PyTorch and HuggingFace ecosystem.☆23Oct 11, 2024Updated last year
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆13Jan 12, 2026Updated 3 months ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- Resources for phage genomics and annotation☆10Oct 27, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Jan 22, 2025Updated last year
- 💡💡💡awesome compute vision app in gradio☆55May 17, 2024Updated last year
- ☆16Nov 5, 2018Updated 7 years ago
- NART = NART is not A RunTime, a deep learning inference framework.☆37Mar 2, 2023Updated 3 years ago
- ☆13Apr 13, 2026Updated 3 weeks ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 11 months ago
- Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm☆11Nov 12, 2021Updated 4 years ago