SkalskiP / SoM
Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈ
β77Updated last year
Related projects β
Alternatives and complementary repositories for SoM
- Cerule - A Tiny Mighty Vision Modelβ67Updated 2 months ago
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timmβ¦β119Updated this week
- Enhancement in Multimodal Representation Learning.β39Updated 8 months ago
- Summarize any Arixv Paper with easeβ60Updated last year
- β62Updated last month
- An automated tool for discovering insights from research papaer corporaβ135Updated 5 months ago
- β52Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β77Updated 5 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ππ¦β61Updated last year
- β81Updated last month
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editingβ65Updated 6 months ago
- β30Updated 11 months ago
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826β52Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understandingβ38Updated last month
- β35Updated last year
- β29Updated 11 months ago
- run paligemma in real timeβ122Updated 6 months ago
- Gradio UI for a Cog APIβ64Updated 7 months ago
- β104Updated 8 months ago
- The Next Generation Multi-Modality Superintelligenceβ70Updated 2 months ago
- β62Updated 4 months ago
- β59Updated 5 months ago
- Finetune any model on HF in less than 30 secondsβ56Updated last week
- Using multiple LLMs for ensemble Forecastingβ16Updated 10 months ago
- β54Updated 10 months ago
- β59Updated last month
- β57Updated 11 months ago
- β16Updated last month
- Routing on Random Forest (RoRF)β84Updated last month