☆76Nov 22, 2024Updated last year
Alternatives and similar repositories for DIOPI
Users that are interested in DIOPI are comparing it to the libraries listed below
Sorting:
- ☆74Oct 31, 2024Updated last year
- ☆28Jan 7, 2025Updated last year
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆33Aug 31, 2022Updated 3 years ago
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Jan 13, 2024Updated 2 years ago
- ☆33Mar 13, 2026Updated last week
- ☆53Mar 3, 2026Updated 2 weeks ago
- Sequence-level 1F1B schedule for LLMs.☆38Aug 26, 2025Updated 6 months ago
- ☆15Sep 3, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago
- ☆74Updated this week
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆419Aug 21, 2025Updated 7 months ago
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitcode.com/Ascend/pytorch☆493Updated this week
- mllm-npu: training multimodal large language models on Ascend NPUs☆94Aug 29, 2024Updated last year
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated 2 months ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆11Jan 28, 2026Updated last month
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆123Dec 25, 2025Updated 2 months ago
- A benchmark and playground for Completely Fair Scheduling in Go☆11Feb 12, 2022Updated 4 years ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆23Jul 3, 2025Updated 8 months ago
- Simple intermediate representation language for learning and research.☆20Mar 27, 2020Updated 5 years ago
- A throughput-oriented high-performance serving framework for LLMs☆949Oct 29, 2025Updated 4 months ago
- A model compilation solution for various hardware☆467Aug 20, 2025Updated 7 months ago
- NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process com…☆482Mar 10, 2026Updated last week
- triton for dsa☆60Updated this week
- ☆352Jan 28, 2026Updated last month
- Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.☆176Updated this week
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆36Jan 16, 2026Updated 2 months ago
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆244Jan 17, 2026Updated 2 months ago
- Inference Llama 2 in one file of pure go☆16Jul 25, 2023Updated 2 years ago
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated last year
- Code for training & inference with FLAN family of models☆17May 23, 2023Updated 2 years ago
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10May 3, 2022Updated 3 years ago
- ☆131Nov 11, 2024Updated last year
- Allow torch tensor memory to be released and resumed later☆225Mar 10, 2026Updated last week
- Deep learning for time-varying multi-entity datasets☆17May 12, 2018Updated 7 years ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 9 months ago
- ☆11Jun 24, 2021Updated 4 years ago