SkalskiP/SoM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SkalskiP/SoM)

SkalskiP / SoM

Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️

☆88

Alternatives and similar repositories for SoM

Users that are interested in SoM are comparing it to the libraries listed below

Sorting:

ANTONIOPSD / CaptionIMG
View on GitHub
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Mar 20, 2023Updated 2 years ago
microsoft / SoM
View on GitHub
[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs
☆1,517Aug 19, 2024Updated last year
notarussianteenager / srf-attention
View on GitHub
Simplex Random Feature attention, in PyTorch
☆76Oct 10, 2023Updated 2 years ago
htqin / GoogleBard-VisUnderstand
View on GitHub
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges
☆30Sep 24, 2023Updated 2 years ago
enricd / st_llms_arena
View on GitHub
Streamlit app presented to the Streamlit LLMs Hackathon September 23
☆15May 13, 2024Updated last year
sayakpaul / caption-upsampling
View on GitHub
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
☆159Oct 25, 2023Updated 2 years ago
giangdip2410 / HyperRouter
View on GitHub
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Nov 29, 2023Updated 2 years ago
Paulescu / adversarial-machine-learning
View on GitHub
Hands-on tutorial on adversarial examples 😈. With Streamlit app ❤️.
☆31Jun 17, 2022Updated 3 years ago
autodistill / autodistill-metaclip
View on GitHub
MetaCLIP module for use with Autodistill.
☆22Dec 5, 2023Updated 2 years ago
StanGirard / starfinder
View on GitHub
Extract valuable information from your project github Stars & Forks such as email, company, twitter and then explore it with streamlit🌟
☆21Feb 8, 2024Updated 2 years ago
UCSC-VLAA / FedConv
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…
☆25Apr 30, 2024Updated last year
zhengxuJosh / SAM4SS
View on GitHub
SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation
☆11Jul 31, 2024Updated last year
livekit-examples / rover-teleop
View on GitHub
Basic rover demo from Raspberry Pi with remote teleop over LiveKit
☆15Jul 10, 2025Updated 7 months ago
michealroberts / samps
View on GitHub
Modern, type-safe, zero-dependency Python library for serial port I/O access
☆23Dec 16, 2025Updated 2 months ago
Atlas3DSS / Simple_Chroma_Interface
View on GitHub
This is a simple interface for chroma - it takes in documents, embeds them into a DB and allows you to query over them using GPT 3.5
☆10Dec 7, 2024Updated last year
SkunkworksAI / BakLLaVA
View on GitHub
☆718Mar 6, 2024Updated 2 years ago
sayakpaul / Revisiting-Pooling-in-CNNs
View on GitHub
Implements RNNPool and SoftPool for CNNs.
☆14Jan 29, 2021Updated 5 years ago
RyanLucas3 / poasterGPT
View on GitHub
A single notebook for fine-tuning GPT-3.5 turbo
☆31Aug 16, 2024Updated last year
NandaKishoreJoshi / Reinforcement_Lerning
View on GitHub
This repo consists all my RL work and learnings
☆12Dec 5, 2021Updated 4 years ago
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
interlynk-io / sbommv
View on GitHub
SBOM Move - Automate build and transfer of SBOMs across systems
☆25Updated this week
tum-pbs / VOLSIM
View on GitHub
VolSiM, a CNN-based metric to compute the similarity of 3D data from numerical simulations
☆15Oct 4, 2023Updated 2 years ago
at-aaims / forge
View on GitHub
☆15Apr 21, 2025Updated 10 months ago
camenduru / ShareGPT4V-colab
View on GitHub
☆30Dec 19, 2023Updated 2 years ago
sabetAI / BLoRA
View on GitHub
batched loras
☆350Sep 6, 2023Updated 2 years ago
sdan / selfextend
View on GitHub
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Jan 7, 2024Updated 2 years ago
camenduru / MotionGPT-colab
View on GitHub
☆17Jan 2, 2024Updated 2 years ago
kyegomez / HRTX
View on GitHub
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
☆15Jun 27, 2025Updated 8 months ago
Coding-Crashkurse / LangChain-Flask-Blog
View on GitHub
☆13Jul 18, 2023Updated 2 years ago
gregor-ge / mBLIP
View on GitHub
☆88Jan 10, 2024Updated 2 years ago
QuixiAI / dolphin-logger
View on GitHub
☆106Nov 1, 2025Updated 4 months ago
zzxslp / SoM-LLaVA
View on GitHub
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
☆145Aug 23, 2024Updated last year
camenduru / DemoFusion-colab
View on GitHub
☆24Dec 10, 2023Updated 2 years ago
ivanfioravanti / asitop
View on GitHub
Perf monitoring CLI tool for Apple Silicon
☆16Jan 1, 2024Updated 2 years ago
Delve-ERAV1 / Phi-2-Vision-Language
View on GitHub
Pretraining and finetuning for visual instruction following with Mixture of Experts
☆16Jan 30, 2024Updated 2 years ago
camenduru / video-dubbing-colab
View on GitHub
☆16Sep 30, 2023Updated 2 years ago
AIAnytime / SaaS-Startup-using-Generative-AI
View on GitHub
A SaaS Startup using Generative AI. This is a code bug fixer SaaS built using Azure OpenAI, Stripe, SQLite, and web technologies.
☆16Sep 22, 2023Updated 2 years ago
MCR-PEFT / Ex-MCR
View on GitHub
☆45May 20, 2025Updated 9 months ago
yaohui-wyh / ctoc
View on GitHub
Count Tokens of Code (forked from gocloc)
☆44Aug 19, 2024Updated last year