Comprehensive CUDA tutorials for Maths & ML with examples
☆232Jun 11, 2025Updated last year
Alternatives and similar repositories for cuda-tutorials
Users that are interested in cuda-tutorials are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Your favourite classical machine learning algos on the GPU/TPU☆23Dec 14, 2025Updated 6 months ago
- Experimental paper writing linter.☆35Sep 2, 2024Updated last year
- Make triton easier☆50Jun 12, 2024Updated 2 years ago
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- Fetch arxiv data to LLM-friendly text☆132Feb 18, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆108May 31, 2024Updated 2 years ago
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated last year
- Official repository for the review article "Modeling protein–ligand interactions for drug discovery in the era of deep learning."☆45Jun 23, 2026Updated last week
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆166Nov 25, 2025Updated 7 months ago
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- ☆10Apr 16, 2021Updated 5 years ago
- ☆13Mar 18, 2022Updated 4 years ago
- JAX library for training sub-4B foundation models for edge☆302Aug 28, 2024Updated last year
- Cervical spine 3D segmentation and classification using machine learning to detect fractures and assess vertebrae health.☆25Oct 27, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.☆35Apr 9, 2024Updated 2 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 11 months ago
- ☆87Jan 15, 2024Updated 2 years ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated last year
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆116Sep 20, 2024Updated last year
- ☆51May 31, 2024Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆13Jul 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A fast RWKV Tokenizer written in Rust☆54Aug 12, 2025Updated 10 months ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- ☆32Nov 4, 2024Updated last year
- Learning records for building a large language model from scratch☆58Jan 1, 2025Updated last year
- 生成自动滚动的视频分镜头拆解表格☆16Jul 25, 2024Updated last year
- Repo for the Sente OnlineGo Android app.☆10Nov 13, 2023Updated 2 years ago
- 愚公wiki 是一款轻量的在线博客、知识库、个人笔记或企业文档协作平台,可下载桌面版作为个人笔记本,也可以在线编辑文档,当然也可以自行进行服务化部署,因为这是一款完全开源的写作平台☆17Jul 22, 2024Updated last year
- ☆10Sep 26, 2023Updated 2 years ago
- This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.☆13Sep 19, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆85Dec 28, 2024Updated last year
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆101Jan 25, 2026Updated 5 months ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- ☆53Nov 14, 2024Updated last year
- Applied AI experiments and examples for PyTorch☆323Aug 22, 2025Updated 10 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆149Apr 27, 2024Updated 2 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 3 years ago