FoodSAM: Any Food Segmentation
☆173Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for FoodSAM
Users that are interested in FoodSAM are comparing it to the libraries listed below
Sorting:
- MM'21 Main-Track paper☆121Jan 17, 2024Updated 2 years ago
- Lidar Panoptic Segmentation without Bells and Whistles (IROS 2023)☆25Oct 21, 2023Updated 2 years ago
- [IJCAI'23] Complete Instances Mining for Weakly Supervised Instance Segmentation☆38Feb 14, 2024Updated 2 years ago
- Deep learning based food instance segmentation using synthetic dataset. We provide the trainig, evaluate and inference codes. Also the tr…☆14Jun 1, 2023Updated 2 years ago
- ☆29Jan 23, 2024Updated 2 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆128Oct 8, 2024Updated last year
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Oct 23, 2023Updated 2 years ago
- MICCAI-MLMI-2023: A Single-Point Prompt Network for Nuclei Image Segmentation (Boost SAM)☆30Oct 25, 2023Updated 2 years ago
- ☆22Mar 23, 2025Updated 11 months ago
- [IJCV] PyTorch implementation of "Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation"☆19Oct 25, 2023Updated 2 years ago
- Adaptive Inter-Class Similarity Distillation for Semantic Segmentation (MTAP 2025)☆29Nov 14, 2025Updated 4 months ago
- A Residual Network Design with less than 5 million trainable parameters achieving an accuracy of 96.04% on CIFAR-10.☆27Jul 23, 2024Updated last year
- Official implementation of the paper "From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client…☆11Mar 13, 2025Updated last year
- Open-Source Implementations of Multi-Modal Diffusion Models Optimized for Highest Quality and Ease of Use☆198Mar 26, 2024Updated last year
- Spatio-Temporal MLP-Graph Network for 3D Human Pose Estimation☆25Sep 25, 2023Updated 2 years ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆44Mar 31, 2024Updated last year
- Create API agents from OpenAPI Specs☆186Nov 12, 2023Updated 2 years ago
- The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, published in ICML 2024)☆102Jun 13, 2024Updated last year
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- Official implementation of VLPCook: Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval☆15Mar 25, 2023Updated 2 years ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆42May 20, 2024Updated last year
- Official implementation of Inconsistency Masks. A robust semi-supervised segmentation framework that reframes model disagreement as a…☆19Jan 23, 2026Updated last month
- ☆33May 15, 2024Updated last year
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆32Dec 8, 2023Updated 2 years ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆25Oct 15, 2025Updated 5 months ago
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)☆112Nov 10, 2023Updated 2 years ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding☆293Aug 5, 2025Updated 7 months ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model☆137Aug 6, 2025Updated 7 months ago
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Mar 20, 2023Updated 3 years ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Mar 20, 2024Updated 2 years ago
- simple and efficient baselines for practical semantic segmentation with plain ViTs☆20Mar 9, 2024Updated 2 years ago
- Statewide Visual Geolocalization in the Wild (ECCV 2024)☆74Dec 2, 2024Updated last year