☆31Jan 7, 2025Updated last year
Alternatives and similar repositories for ditorch
Users that are interested in ditorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆76Nov 22, 2024Updated last year
- ☆13May 23, 2025Updated 11 months ago
- ☆77Oct 31, 2024Updated last year
- ☆133Nov 11, 2024Updated last year
- Write events for TensorBoard☆13Apr 27, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- ☆33Feb 3, 2025Updated last year
- ☆10Jul 18, 2024Updated last year
- Web version of the MiniDecaf compiler.☆13Sep 17, 2020Updated 5 years ago
- ☆14Jun 30, 2021Updated 4 years ago
- ☆20Oct 11, 2023Updated 2 years ago
- a demo for openmp , by Jidor☆13Mar 25, 2019Updated 7 years ago
- Tutorial for Ray☆37Mar 31, 2024Updated 2 years ago
- workflow of nndeploy☆13Nov 5, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- faster-rcnn c++ python model☆14Dec 3, 2017Updated 8 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆57Oct 11, 2025Updated 7 months ago
- ☆13Sep 22, 2025Updated 7 months ago
- 🤖FFPA: Extends FlashAttention-2 via Split-D for large headdims, 1.5x~3×↑🎉 vs SDPA, up to 430T🎉 on H200.☆299Updated this week
- 🦀🦀🦀Crablet: Next-gen AI Assistant☆55Updated this week
- Display output from `xo` as a list of style errors, ordered by count☆34Aug 14, 2025Updated 9 months ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- Guide to deploying deep-learning inference networks and deep vision primitives on SOPHON TPU.☆20Nov 14, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Jan 1, 2024Updated 2 years ago
- Asynchronous pipeline parallel optimization☆21Feb 2, 2026Updated 3 months ago
- 电子版书籍☆14Dec 23, 2019Updated 6 years ago
- I read papers, and here are my highlights.☆16Jun 7, 2020Updated 5 years ago
- Nex Venus Communication Library☆75Nov 17, 2025Updated 6 months ago
- ☆19Jun 17, 2025Updated 11 months ago
- Real-time iris detector. Only need 8ms on Intel i5 CPU!☆21Feb 24, 2019Updated 7 years ago
- Deep learning algorithms: A sparse autoencoder (and someday more algorithms), implemented in Common Lisp.☆27Jun 10, 2010Updated 15 years ago
- Runtimex package help to expose Go Runtime internals representation safely.☆12Feb 19, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Really Scalable RL Framework to 10k+ CPUs☆38Feb 29, 2024Updated 2 years ago
- BM25F demo with lucene using BlendedTermQuery and a custom similarity☆14Oct 11, 2016Updated 9 years ago
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆48Jul 29, 2025Updated 9 months ago
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆115Apr 28, 2026Updated 3 weeks ago
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆24May 8, 2026Updated last week
- paper and code for New Directions in Cloud Programming, CIDR 2021☆11Feb 17, 2021Updated 5 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 3 years ago