Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈ
β88Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for SoM
Users that are interested in SoM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsβ1,519Aug 19, 2024Updated last year
- Simplex Random Feature attention, in PyTorchβ76Oct 10, 2023Updated 2 years ago
- Start using computer vision in two minutes with our interactive Python notebook experience.β25Oct 25, 2023Updated 2 years ago
- Implements RNNPool and SoftPool for CNNs.β14Jan 29, 2021Updated 5 years ago
- β15Dec 7, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.β159Oct 25, 2023Updated 2 years ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI trainingβ37Mar 20, 2023Updated 3 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Sep 24, 2023Updated 2 years ago
- MetaCLIP module for use with Autodistill.β22Dec 5, 2023Updated 2 years ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23β15May 13, 2024Updated last year
- β719Mar 6, 2024Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"β33Nov 29, 2023Updated 2 years ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Dβ¦β25Apr 30, 2024Updated last year
- A single notebook for fine-tuning GPT-3.5 turboβ31Aug 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- faster ROS depth image registrationβ13Jun 5, 2018Updated 7 years ago
- β107Nov 1, 2025Updated 4 months ago
- A Goodreads implementation for Obsidian.β16May 21, 2021Updated 4 years ago
- [COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMsβ146Aug 23, 2024Updated last year
- This repo consists all my RL work and learningsβ12Dec 5, 2021Updated 4 years ago
- β22Aug 27, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)β12Oct 11, 2024Updated last year
- Perf monitoring CLI tool for Apple Siliconβ16Jan 1, 2024Updated 2 years ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom oβ¦β19Oct 4, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networksβ55Mar 7, 2026Updated 2 weeks ago
- Access your Ollama inference server running on your computer from anywhere. Set up with NextJS + Langchain JS LCEL + Ngrokβ26Feb 13, 2024Updated 2 years ago
- AcSecurity is a Python module designed to scan applications for common security vulnerabilities. It checks for hardcoded secrets, dependeβ¦β16Aug 29, 2025Updated 6 months ago
- batched lorasβ351Sep 6, 2023Updated 2 years ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,659Mar 16, 2026Updated last week
- β45Oct 13, 2023Updated 2 years ago
- A small repository containing the TeX code for the Succinct Proofs and Linear Algebra study session's slides and homeworkβ23Sep 10, 2025Updated 6 months ago
- VolSiM, a CNN-based metric to compute the similarity of 3D data from numerical simulationsβ15Oct 4, 2023Updated 2 years ago
- β88Jan 10, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentationβ11Jul 31, 2024Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ766Feb 1, 2024Updated 2 years ago
- β39Updated this week
- β10Jun 17, 2022Updated 3 years ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β24Jan 7, 2024Updated 2 years ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UIβ1,063Dec 9, 2024Updated last year
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2β15Jun 27, 2025Updated 8 months ago