Shohruh72 / HRNet-LandmarksLinks
β20Updated 6 months ago
Alternatives and similar repositories for HRNet-Landmarks
Users that are interested in HRNet-Landmarks are comparing it to the libraries listed below
Sorting:
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.β243Updated 5 months ago
- Inference and fine-tuning examples for vision models from π€ Transformersβ162Updated 4 months ago
- Setting up Vscode to work with Pytorch in C/C++ with CUDA supportβ25Updated 10 months ago
- MBASE, an LLM SDK in C++β56Updated 5 months ago
- A tool for converting computer vision label formats.β80Updated 2 weeks ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated 11 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β93Updated last week
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applicβ¦β59Updated 9 months ago
- Ollama's Interactive Prompt Engineering Tutorialβ263Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β84Updated last year
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023β46Updated last year
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vβ¦β125Updated 6 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ276Updated 5 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagramsβ36Updated 8 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AIβ298Updated 2 months ago
- β101Updated last year
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)β333Updated 2 months ago
- β43Updated this week
- 100 Days of GPU Challengeβ24Updated last month
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced β¦β24Updated 11 months ago
- 6D Rotation Representation for Unconstrained Head Pose Estimationβ17Updated 4 months ago
- An SDK for Transformers + YOLO and other SSD family modelsβ65Updated 10 months ago
- β34Updated last year
- Ultralytics Notebooks πβ168Updated last month
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within secondsβ138Updated 2 weeks ago
- Solving Computer Vision with AI agentsβ34Updated 5 months ago
- UniFace: A Comprehensive Library for Face Detection, Recognition, Landmark Analysis, Face Parsing, Gaze Estimation, Age, and Gender Detecβ¦β463Updated this week
- A tool for an analysis of LLM generations.β41Updated 2 months ago
- Notebooks using the Neural Magic libraries πβ39Updated last year
- 2D Positional Embeddings for Webpage Structural Understanding π¦πβ95Updated last year