ThinamXx / cuda-modeLinks
Making of cuda kernel
☆17Updated 4 months ago
Alternatives and similar repositories for cuda-mode
Users that are interested in cuda-mode are comparing it to the libraries listed below
Sorting:
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Sc…☆118Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Notebooks for fine tuning pali gemma☆117Updated 5 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆133Updated 8 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆96Updated 9 months ago
- Building GPT ...☆18Updated 10 months ago
- 100 Days of GPU Challenge☆23Updated last month
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 4 months ago
- ☆45Updated 4 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 11 months ago
- Deep Learning for Computer Vision☆59Updated last year
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- a distributed end-to-end image classification system using kubernetes☆13Updated 9 months ago
- ☆134Updated last year
- vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)☆50Updated 2 weeks ago
- The repository will contain a list of projects which we will work on while reading the books of Natural Language Processing & Transformer…☆73Updated last year
- A collection of hand on notebook for LLMs practitioner☆50Updated 8 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- Notes on quantization in neural networks☆103Updated last year
- ☆138Updated last year
- Fine tune Gemma 3 on an object detection task☆84Updated 2 months ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆119Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch