rumc3dlab / 3dlandmarkdetection
This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files
☆28Updated last year
Alternatives and similar repositories for 3dlandmarkdetection:
Users that are interested in 3dlandmarkdetection are comparing it to the libraries listed below
- TensorFlow code for our ECCV'24 Workshop paper "LightAvatar: Efficient Head Avatar as Dynamic NeLF"☆27Updated 2 months ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Updated 3 months ago
- TensorFlow implementation of a comprehensive comparison of various SSL (Semi-Supervised Learning) approaches in image segmentation, featu…☆18Updated 3 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆54Updated last month
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 4 months ago
- PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba☆45Updated 2 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated last year
- Low-latency Space-time Supersampling for Real-time Rendering☆30Updated 11 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆36Updated 8 months ago
- Command-line script for inferencing from models such as WizardCoder☆26Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆67Updated 8 months ago
- ☆29Updated last year
- ☆13Updated 11 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆17Updated 3 months ago
- ☆20Updated last month
- LiVOS: Light Video Object Segmentation with Gated Linear Matching☆25Updated 2 months ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆17Updated last week
- ☆28Updated last year
- Visual RAG using less than 300 lines of code.☆24Updated 10 months ago
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆12Updated 3 weeks ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated last year
- Repo for event-based binary image reconstruction.☆32Updated 10 months ago
- Official implementation of "GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers"☆78Updated 4 months ago
- ☆33Updated last year
- ☆16Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆41Updated 5 months ago
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆35Updated last year
- ☆11Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆81Updated last year
- A Residual Network Design with less than 5 million trainable parameters achieving an accuracy of 96.04% on CIFAR-10.☆26Updated 6 months ago