国产加速卡-海光DCU实战(大模型训练、微调、推理 等)
☆79Aug 10, 2025Updated 9 months ago
Alternatives and similar repositories for dcu-in-action
Users that are interested in dcu-in-action are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- A Triton-only attention backend for vLLM☆25Mar 17, 2026Updated 2 months ago
- CSDN of ManVictor☆23Mar 31, 2025Updated last year
- ☆14May 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Updated this week
- Live2D Waifu with TTS support (Please use the Beta Branch)☆11Apr 5, 2026Updated last month
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13May 7, 2026Updated 3 weeks ago
- Provides deploy scripts and CSI for Lustre.☆14Apr 13, 2026Updated last month
- ☆12Mar 25, 2020Updated 6 years ago
- A Triton JIT runtime and ffi provider in C++☆35Updated this week
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- ☆20Sep 28, 2024Updated last year
- KubeAttention is a residency-aware scheduler plugin that uses machine learning to detect and avoid noisy neighbor interference.☆48Jan 17, 2026Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆28Oct 14, 2024Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆56Nov 22, 2025Updated 6 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- The code for ICLR2025 paper "SLMRec: Empowering Small Language Models for Sequential Recommendation".☆51Jun 16, 2025Updated 11 months ago
- An implementation of MSSRM method☆10Mar 23, 2023Updated 3 years ago
- patches for huggingface transformers to save memory☆36May 9, 2026Updated 2 weeks ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [WIP] Simple scheduler and scenario system for learning Kubernetes Scheduler☆54May 15, 2022Updated 4 years ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 6 months ago
- [CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge☆40Sep 17, 2025Updated 8 months ago
- InfiniBand SR-IOV CNI☆13Apr 15, 2026Updated last month
- A model serving framework for various research and production scenarios. Seamlessly built upon the PyTorch and HuggingFace ecosystem.☆23Oct 11, 2024Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- Resources for phage genomics and annotation☆10Oct 27, 2025Updated 7 months ago
- ☆13Jan 22, 2025Updated last year
- Flip flop setup, hold & metastability explorer tool☆52Oct 28, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NART = NART is not A RunTime, a deep learning inference framework.☆37Mar 2, 2023Updated 3 years ago
- ☆16Jan 16, 2025Updated last year
- ☆14May 1, 2023Updated 3 years ago
- ☆13May 11, 2026Updated 2 weeks ago
- [ACL 2025] iAgent: LLM Agent as a Shield between User and Recommender Systems☆31May 23, 2025Updated last year
- Code and data for the CCS'19 paper "Watching You Watch: The Tracking Ecosystem of Over-the-TopTV Streaming Devices"☆13Dec 14, 2019Updated 6 years ago
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago