HugoLB0 / gpt-4-enhanced
OpenAI GPT-4 assistant, combined with the power of YoloV8 realtime object detection, Whisper speech recognition, text to speech and google browsing feature.
☆16Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for gpt-4-enhanced
- Google Gemini Vision Web application with Speech and Text☆45Updated 9 months ago
- Real-Time Open-Vocabulary Object Detection☆13Updated 9 months ago
- Object segmentation in collaboration with Segment Anyting Model and Yolov8☆23Updated last year
- ☆42Updated last year
- Real-time lane and car detection system using YOLOv8 and OpenCV, with distance estimation for vehicles. Ideal for autonomous driving and …☆17Updated last month
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆27Updated last year
- A RAG based Generative AI Attorney fed with Indian Penal Code data. Developed using Streamlit, LangChain and TogetherAI API.☆43Updated 6 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- autoAnnoter its a tool to auto annotate data using a exisiting models☆43Updated 3 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆10Updated 3 months ago
- This repository is about an APP to help lawyers to process law documents and suit cases using AI Agents trained with OpenAI and others LL…☆16Updated last year
- A project using YoloV8 to detect License Plates☆11Updated last year
- Implementation on Custom Dataset☆60Updated last year
- CrewAI AgentOps: Monitor your AI Agents☆13Updated 4 months ago
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.☆20Updated 5 months ago
- Agentic RAG using Crew AI.☆19Updated 4 months ago
- This repo explains the custom object detection training using Yolov8.☆18Updated last year
- People Counter using YOLOv8 and Object Tracking |People Counting (Entering & Leaving)☆12Updated last year
- ☆15Updated 6 months ago
- Summarise YouTube videos and save time!☆57Updated 9 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated 11 months ago
- Eye exploration☆22Updated this week
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆20Updated 8 months ago
- YOLOv8 object detection, tracking, image segmentation and pose estimation app using Ultralytics API (for detection, segmentation and pose…☆70Updated 10 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- This repo is a packaged version of the Yolov9 model.☆84Updated 2 weeks ago
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 2 years ago
- Multimodal AI App using Llava 7B and Gradio.☆37Updated 6 months ago
- Realtime voice assistant powered by Groq's whisper API, Groq's Llama and ElevenLabs text-to-speech☆29Updated 4 months ago
- Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand English, German and Albanian. Ba…☆19Updated 5 months ago