Shohruh72 / HRNet-LandmarksLinks
☆20Updated 7 months ago
Alternatives and similar repositories for HRNet-Landmarks
Users that are interested in HRNet-Landmarks are comparing it to the libraries listed below
Sorting:
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆252Updated 6 months ago
- Setting up Vscode to work with Pytorch in C/C++ with CUDA support☆25Updated 11 months ago
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆165Updated 5 months ago
- ☆102Updated last week
- ☆43Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Learn to build and deploy local Visual Language Models for Edge AI☆371Updated 3 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆129Updated 7 months ago
- ☆101Updated last year
- Ollama's Interactive Prompt Engineering Tutorial☆266Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆97Updated last week
- ☆57Updated last week
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆39Updated 9 months ago
- ☆47Updated last year
- AI agent to automatically check grammar and spelling on documentation files☆94Updated 2 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆349Updated 2 weeks ago
- Self-host LLMs with vLLM and BentoML☆167Updated last week
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Updated 11 months ago
- Fine tune Gemma 3 on an object detection task☆96Updated 6 months ago
- Open-source CLI toolkit for low-RAM finetuning, quantization, and deployment of LLMs☆92Updated 6 months ago
- This Repository demostrates various examples using YOLO☆13Updated last year
- ☆10Updated 11 months ago
- World's Smallest Vision-Language Model☆32Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- Solving Computer Vision with AI agents☆35Updated 6 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆382Updated last week
- A new novel multi-modality (Vision) RAG architecture☆35Updated last year
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year