Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈ
β87Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for SoM
Users that are interested in SoM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMsβ1,542Aug 19, 2024Updated last year
- Simplex Random Feature attention, in PyTorchβ76Oct 10, 2023Updated 2 years ago
- Start using computer vision in two minutes with our interactive Python notebook experience.β24Oct 25, 2023Updated 2 years ago
- Implements RNNPool and SoftPool for CNNs.β14Jan 29, 2021Updated 5 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.β159Oct 25, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Simple program to manually caption your images (or any other file types) so you can use them for AI trainingβ37Mar 20, 2023Updated 3 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challengesβ30Sep 24, 2023Updated 2 years ago
- MetaCLIP module for use with Autodistill.β21Dec 5, 2023Updated 2 years ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23β15May 13, 2024Updated 2 years ago
- β719Mar 6, 2024Updated 2 years ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"β33Nov 29, 2023Updated 2 years ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Dβ¦β25Apr 30, 2024Updated 2 years ago
- A single notebook for fine-tuning GPT-3.5 turboβ31Aug 16, 2024Updated last year
- faster ROS depth image registrationβ13Jun 5, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β107Nov 1, 2025Updated 7 months ago
- β44May 20, 2025Updated last year
- [COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMsβ145Aug 23, 2024Updated last year
- β22Aug 27, 2023Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)β12Oct 11, 2024Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom oβ¦β19Oct 4, 2024Updated last year
- Perf monitoring CLI tool for Apple Siliconβ16Jan 1, 2024Updated 2 years ago
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networksβ57Mar 7, 2026Updated 3 months ago
- Access your Ollama inference server running on your computer from anywhere. Set up with NextJS + Langchain JS LCEL + Ngrokβ27Feb 13, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β13Oct 15, 2024Updated last year
- AcSecurity is a Python module designed to scan applications for common security vulnerabilities. It checks for hardcoded secrets, dependeβ¦β16Aug 29, 2025Updated 9 months ago
- β21Oct 2, 2022Updated 3 years ago
- β17Jan 2, 2024Updated 2 years ago
- Simple CogVLM client scriptβ13Dec 20, 2023Updated 2 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived fβ¦β16Apr 22, 2021Updated 5 years ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,679Jun 8, 2026Updated last week
- β45Oct 13, 2023Updated 2 years ago
- β88Jan 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ767Feb 1, 2024Updated 2 years ago
- Memory-Based Instance-Level Adaptation for Cross-Domain Object Detectionβ15Jul 11, 2024Updated last year
- β46Mar 31, 2026Updated 2 months ago
- β10Jun 17, 2022Updated 3 years ago
- β12Apr 24, 2024Updated 2 years ago
- papers.dayβ93Dec 15, 2023Updated 2 years ago
- A PyTorch implementation of Mixupβ14Jun 2, 2018Updated 8 years ago