tapanBabbar9 / computer-vision
Experiments with CV
☆25Updated last month
Alternatives and similar repositories for computer-vision:
Users that are interested in computer-vision are comparing it to the libraries listed below
- ☆20Updated 3 months ago
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆27Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆14Updated 6 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated 11 months ago
- This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Ope…☆36Updated 2 years ago
- Jockey is a conversational video agent.☆73Updated 3 weeks ago
- AI narrator☆15Updated last year
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated last month
- Chrome extension that interacts with content using Groq☆41Updated last month
- 🧠 Mem4AI: A LLM Friendly memory management library.☆18Updated 3 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆13Updated last week
- ☆22Updated 4 months ago
- The purpose of the "Meta Agent with More Agents" project is to dynamically solve complex queries by breaking them down into smaller tasks…☆22Updated last month
- ☆12Updated 2 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆74Updated 5 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 2 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆16Updated last year
- Hybrid-RAG is a hybrid Retrieval-Augmented Generation (RAG) model that leverages BERT for retrieving relevant documents and GPT-2 for gen…☆24Updated 2 weeks ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Modified to use local Ollama endpoint☆48Updated 4 months ago
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆85Updated 3 weeks ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 6 months ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆56Updated 11 months ago
- GroqChat: Local ChatGPT-like environment in your browser using best open model LLama 3.1 Series on the Grow fastest inference engine.☆79Updated 6 months ago
- Web Interface for Vision Language Models Including InternVLM2☆17Updated 6 months ago
- Simple CogVLM client script☆14Updated last year
- ☆46Updated last year
- Okra, your all in one personal AI assistant☆14Updated 8 months ago