Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈ
β88Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for SoM
Users that are interested in SoM are comparing it to the libraries listed below
Sorting:
- Simple program to manually caption your images (or any other file types) so you can use them for AI trainingβ37Mar 20, 2023Updated 2 years ago
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsβ1,517Aug 19, 2024Updated last year
- Simplex Random Feature attention, in PyTorchβ76Oct 10, 2023Updated 2 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Sep 24, 2023Updated 2 years ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23β15May 13, 2024Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.β159Oct 25, 2023Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"β33Nov 29, 2023Updated 2 years ago
- Hands-on tutorial on adversarial examples π. With Streamlit app β€οΈ.β31Jun 17, 2022Updated 3 years ago
- MetaCLIP module for use with Autodistill.β22Dec 5, 2023Updated 2 years ago
- Extract valuable information from your project github Stars & Forks such as email, company, twitter and then explore it with streamlitπβ21Feb 8, 2024Updated 2 years ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Dβ¦β25Apr 30, 2024Updated last year
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentationβ11Jul 31, 2024Updated last year
- Basic rover demo from Raspberry Pi with remote teleop over LiveKitβ15Jul 10, 2025Updated 7 months ago
- Modern, type-safe, zero-dependency Python library for serial port I/O accessβ23Dec 16, 2025Updated 2 months ago
- This is a simple interface for chroma - it takes in documents, embeds them into a DB and allows you to query over them using GPT 3.5β10Dec 7, 2024Updated last year
- β718Mar 6, 2024Updated 2 years ago
- Implements RNNPool and SoftPool for CNNs.β14Jan 29, 2021Updated 5 years ago
- A single notebook for fine-tuning GPT-3.5 turboβ31Aug 16, 2024Updated last year
- This repo consists all my RL work and learningsβ12Dec 5, 2021Updated 4 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)β12Oct 11, 2024Updated last year
- SBOM Move - Automate build and transfer of SBOMs across systemsβ25Updated this week
- VolSiM, a CNN-based metric to compute the similarity of 3D data from numerical simulationsβ15Oct 4, 2023Updated 2 years ago
- β15Apr 21, 2025Updated 10 months ago
- β30Dec 19, 2023Updated 2 years ago
- batched lorasβ350Sep 6, 2023Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Jan 7, 2024Updated 2 years ago
- β17Jan 2, 2024Updated 2 years ago
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2β15Jun 27, 2025Updated 8 months ago
- β13Jul 18, 2023Updated 2 years ago
- β88Jan 10, 2024Updated 2 years ago
- β106Nov 1, 2025Updated 4 months ago
- [COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMsβ145Aug 23, 2024Updated last year
- β24Dec 10, 2023Updated 2 years ago
- Perf monitoring CLI tool for Apple Siliconβ16Jan 1, 2024Updated 2 years ago
- Pretraining and finetuning for visual instruction following with Mixture of Expertsβ16Jan 30, 2024Updated 2 years ago
- β16Sep 30, 2023Updated 2 years ago
- A SaaS Startup using Generative AI. This is a code bug fixer SaaS built using Azure OpenAI, Stripe, SQLite, and web technologies.β16Sep 22, 2023Updated 2 years ago
- β45May 20, 2025Updated 9 months ago
- Count Tokens of Code (forked from gocloc)β44Aug 19, 2024Updated last year