Comprehensive CUDA tutorials for Maths & ML with examples
☆223Jun 11, 2025Updated 10 months ago
Alternatives and similar repositories for cuda-tutorials
Users that are interested in cuda-tutorials are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- ☆52Feb 5, 2025Updated last year
- Make triton easier☆50Jun 12, 2024Updated last year
- ☆12Sep 11, 2020Updated 5 years ago
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fetch arxiv data to LLM-friendly text☆131Feb 18, 2026Updated 2 months ago
- Rust implementation of Surya☆66Mar 1, 2025Updated last year
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated last year
- High-Performance C++ Fundamental Library☆638Mar 16, 2026Updated last month
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69May 9, 2023Updated 2 years ago
- The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.☆35Apr 9, 2024Updated 2 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆85Jan 15, 2024Updated 2 years ago
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆9,093Mar 30, 2026Updated 2 weeks ago
- ☆51May 31, 2024Updated last year
- ☆13Jul 15, 2021Updated 4 years ago
- LLM as World Models using Bayesian inference☆17May 27, 2025Updated 10 months ago
- ☆33Nov 4, 2024Updated last year
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Sep 19, 2022Updated 3 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Dec 28, 2024Updated last year
- Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…☆30Jan 13, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- implement a simple jvm with java☆104Mar 7, 2024Updated 2 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- ☆10Dec 21, 2020Updated 5 years ago
- Applied AI experiments and examples for PyTorch☆320Aug 22, 2025Updated 7 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆145Apr 27, 2024Updated last year
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- GPU programming related news and material links☆2,093Mar 8, 2026Updated last month
- 通 过HTTP接口微信发送消息☆22Oct 10, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 10 months ago
- A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…☆316Feb 28, 2025Updated last year
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 7, 2026Updated last week
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- 推荐系统入门指南,全面介绍了工业级推荐系统的理论知识(王树森推荐系统公开课-基于小红书的场景讲解工业界真实的推荐系统),如何基于TensorFlow2训练模型,如何实现高性能、高并发、高可用的Golang推理微服务。Comprehensively introduced th…☆703Feb 10, 2025Updated last year