roboflow / cog-vlm-clientLinks

Simple CogVLM client script

☆14

Alternatives and similar repositories for cog-vlm-client

Users that are interested in cog-vlm-client are comparing it to the libraries listed below

Sorting:

LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆21Updated 9 months ago
camenduru / MiniGPT-v2-colab
☆29Updated last year
sanchit-gandhi / notebooks
A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).
☆45Updated last year
kyegomez / Qwen-VL
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…
☆12Updated last year
roboflow / inference-client
☆14Updated last year
autodistill / autodistill-grounded-edgesam
EdgeSAM model for use with Autodistill.
☆27Updated last year
autodistill / autodistill-florence-2
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
☆65Updated 11 months ago
SkalskiP / SoM
Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️
☆87Updated last year
lancedb / yoloexplorer
YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds
☆133Updated this week
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
iulia-b10 / multilingual-embedding-models
☆20Updated last year
camenduru / MoE-LLaVA-jupyter
☆16Updated last year
roboflow / cvevals
Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…
☆36Updated last year
capjamesg / sam-gpt4v
Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.
☆66Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 11 months ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆82Updated last year
vrizawahyu22 / juggling_counting
☆60Updated last year
The-Swarm-Corporation / AgentParse
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆14Updated 2 weeks ago
deep-diver / Vid2Persona
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
☆47Updated last year
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆57Updated 2 weeks ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated last year
dusty-nv / NanoDB
Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP
☆61Updated 3 months ago
4dh / GRDN
GRDN.AI app for garden optimization
☆70Updated last year
neuralmagic / examples
Notebooks using the Neural Magic libraries 📓
☆40Updated last year
camenduru / YoloWorld-EfficientSAM-jupyter
☆46Updated last year
herrera-luis / vision-core-ai
Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
☆46Updated last year
SkalskiP / YOLO-World
Real-Time Open-Vocabulary Object Detection
☆13Updated last year
autodistill / autodistill-gpt-4v
GPT-4V(ision) module for use with Autodistill.
☆26Updated last year
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Updated 3 weeks ago
13331112522 / v-rag
Visual RAG using less than 300 lines of code.
☆28Updated last year