Comprehensive CUDA tutorials for Maths & ML with examples
☆224Jun 11, 2025Updated 11 months ago
Alternatives and similar repositories for cuda-tutorials
Users that are interested in cuda-tutorials are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 5 months ago
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- MIRA - Multimodal Image Reconstruction with Attention is a transformer (Encoder-Decoder) based architecture for Text / Image to 3D recons…☆14Mar 11, 2024Updated 2 years ago
- Fetch arxiv data to LLM-friendly text☆132Feb 18, 2026Updated 3 months ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110May 31, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Stream live plots to a matplotlib figure☆81Apr 18, 2025Updated last year
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆49Jan 8, 2024Updated 2 years ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆166Nov 25, 2025Updated 5 months ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69May 9, 2023Updated 3 years ago
- SOTA model implementations in JAX/FLAX☆301Aug 28, 2024Updated last year
- Cervical spine 3D segmentation and classification using machine learning to detect fractures and assess vertebrae health.☆25Oct 27, 2023Updated 2 years ago
- The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.☆35Apr 9, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 9 months ago
- ☆85Jan 15, 2024Updated 2 years ago
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆9,175Updated this week
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆115Sep 20, 2024Updated last year
- ☆51May 31, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19May 13, 2026Updated last week
- ☆13Jul 15, 2021Updated 4 years ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆509May 10, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LLM as World Models using Bayesian inference☆18May 27, 2025Updated 11 months ago
- ☆33Nov 4, 2024Updated last year
- Learning records for building a large language model from scratch☆59Jan 1, 2025Updated last year
- 生成自动滚动的视频分镜头拆解表格☆16Jul 25, 2024Updated last year
- Examples for inference models with ONNXRuntime and CUDA☆26Jul 28, 2023Updated 2 years ago
- ☆10Sep 26, 2023Updated 2 years ago
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Sep 19, 2022Updated 3 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Dec 28, 2024Updated last year
- [NeurIPS 2024] Terra: A Multimodal Spatio-Temporal Dataset Spanning the Earth☆79Nov 9, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- 🟠 A study guide to learn about Graph Neural Networks (GNNs)☆1,292Jan 6, 2023Updated 3 years ago
- implement a simple jvm with java☆104Mar 7, 2024Updated 2 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Applied AI experiments and examples for PyTorch☆321Aug 22, 2025Updated 8 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- This is the shared package to simulate pulse propagation in bulk material (solid and gas) with 3D-UPPE☆13Apr 1, 2026Updated last month