ThinamXx / cuda-modeLinks
Making of cuda kernel
β16Updated 3 weeks ago
Alternatives and similar repositories for cuda-mode
Users that are interested in cuda-mode are comparing it to the libraries listed below
Sorting:
- Complete implementation of Llama2 with/without KV cache & inference πβ46Updated last year
- 100 Days of GPU Challengeβ20Updated 3 weeks ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.β11Updated 5 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)β11Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β31Updated last year
- a distributed end-to-end image classification system using kubernetesβ11Updated 5 months ago
- Composition of Multimodal Language Models From Scratchβ14Updated 10 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUsβ38Updated 7 months ago
- zero-to-lightningβ29Updated last year
- A collection of hand on notebook for LLMs practitionerβ48Updated 5 months ago
- Quantization of LLMs and benchmarking.β10Updated last year
- β39Updated last month
- Building GPT ...β18Updated 6 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- serving a torch model using Celery, Redis and RabbitMQ to serve users asynchronouslyβ21Updated last year
- Image Search Engine with HuggingFace Sentence Transformerβ12Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ116Updated 5 months ago
- π€ AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.β17Updated last year
- Awesome MLOps Course Outlineβ34Updated 2 years ago
- Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.β10Updated 9 months ago
- The repository will contain a list of projects which we will work on while reading the books of Natural Language Processing & Transformerβ¦β71Updated last year
- CVPR 2024 Research Paper with Codeβ48Updated 11 months ago
- Fine tune Gemma 3 on an object detection taskβ57Updated this week
- Building LLMs from scratch following the book from S. Raschkaβ31Updated 2 months ago
- Fine-tune an LLM to perform batch inference and online serving.β112Updated 3 weeks ago
- From Scratch Implementation of some popular Deep Learning Papers with PyTorch and Tensorflowβ17Updated 2 years ago
- vision transformers with pytorch and pytorch lightningβ1Updated 8 months ago
- Accelerate Model Training with PyTorch 2.X, published by Packtβ44Updated last year
- This repository shows various ways of deploying a vision model (TensorFlow) from π€ Transformers.β30Updated 2 years ago
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilsonβ20Updated last year