Sayandip170900 / CUDA-ChallengeLinks
100 Days of GPU Challenge
☆23Updated last month
Alternatives and similar repositories for CUDA-Challenge
Users that are interested in CUDA-Challenge are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Eye exploration☆29Updated 8 months ago
- Building LLMs from scratch following the book from S. Raschka☆31Updated 7 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 9 months ago
- ☆16Updated 5 months ago
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆40Updated last month
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆32Updated last year
- Fine tune Gemma 3 on an object detection task☆87Updated 3 months ago
- This Repository demostrates various examples using YOLO☆13Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 8 months ago
- ☆59Updated last year
- Timm model explorer☆42Updated last year
- ☆22Updated 9 months ago
- Notebooks for fine tuning pali gemma☆117Updated 6 months ago
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆13Updated 2 months ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆29Updated 8 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 11 months ago
- ☆163Updated 3 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆90Updated last week
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆96Updated 10 months ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 5 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 9 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated 2 years ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago