Sayandip170900 / CUDA-ChallengeLinks
100 Days of GPU Challenge
☆22Updated 2 weeks ago
Alternatives and similar repositories for CUDA-Challenge
Users that are interested in CUDA-Challenge are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- This Repository demostrates various examples using YOLO☆13Updated last year
- ☆21Updated 8 months ago
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 10 months ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 8 months ago
- Fine tune Gemma 3 on an object detection task☆82Updated 2 months ago
- Building LLMs from scratch following the book from S. Raschka☆31Updated 5 months ago
- Notebooks for fine tuning pali gemma☆117Updated 5 months ago
- Timm model explorer☆41Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated last year
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆29Updated last week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- ☆16Updated 3 months ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Making of cuda kernel☆17Updated 3 months ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆161Updated last month
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- ☆29Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆96Updated 9 months ago
- Multimodal AI workloads: batch inference, model training and online serving.☆59Updated 3 weeks ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- Notebooks to demonstrate TimmWrapper☆16Updated 8 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 6 months ago
- Documentation, notes, links, etc for streams.☆83Updated last year