Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈ
β87Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for SoM
Users that are interested in SoM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsβ1,524Aug 19, 2024Updated last year
- Start using computer vision in two minutes with our interactive Python notebook experience.β24Oct 25, 2023Updated 2 years ago
- β14Dec 7, 2023Updated 2 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.β159Oct 25, 2023Updated 2 years ago
- Hands-on tutorial on adversarial examples π. With Streamlit app β€οΈ.β32Jun 17, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MetaCLIP module for use with Autodistill.β21Dec 5, 2023Updated 2 years ago
- β719Mar 6, 2024Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"β33Nov 29, 2023Updated 2 years ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Dβ¦β25Apr 30, 2024Updated last year
- A single notebook for fine-tuning GPT-3.5 turboβ31Aug 16, 2024Updated last year
- faster ROS depth image registrationβ13Jun 5, 2018Updated 7 years ago
- β107Nov 1, 2025Updated 5 months ago
- β45May 20, 2025Updated 10 months ago
- This repo consists all my RL work and learningsβ12Dec 5, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β22Aug 27, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)β12Oct 11, 2024Updated last year
- Perf monitoring CLI tool for Apple Siliconβ16Jan 1, 2024Updated 2 years ago
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networksβ56Mar 7, 2026Updated last month
- Extract valuable information from your project github Stars & Forks such as email, company, twitter and then explore it with streamlitπβ21Feb 8, 2024Updated 2 years ago
- β21Oct 2, 2022Updated 3 years ago
- batched lorasβ351Sep 6, 2023Updated 2 years ago
- β17Jan 2, 2024Updated 2 years ago
- Simple CogVLM client scriptβ14Dec 20, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hutter Prize Submissionβ13Aug 9, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived fβ¦β16Apr 22, 2021Updated 4 years ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,667Apr 6, 2026Updated last week
- β45Oct 13, 2023Updated 2 years ago
- A small repository containing the TeX code for the Succinct Proofs and Linear Algebra study session's slides and homeworkβ22Sep 10, 2025Updated 7 months ago
- β88Jan 10, 2024Updated 2 years ago
- VolSiM, a CNN-based metric to compute the similarity of 3D data from numerical simulationsβ15Oct 4, 2023Updated 2 years ago
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentationβ11Jul 31, 2024Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ766Feb 1, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Memory-Based Instance-Level Adaptation for Cross-Domain Object Detectionβ15Jul 11, 2024Updated last year
- papers.dayβ93Dec 15, 2023Updated 2 years ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β24Jan 7, 2024Updated 2 years ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UIβ1,064Dec 9, 2024Updated last year
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2β15Jun 27, 2025Updated 9 months ago
- llama.cpp with BakLLaVA model describes what does it seeβ379Nov 8, 2023Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Jan 7, 2024Updated 2 years ago