HugoLB0 / gpt-4-enhancedLinks
OpenAI GPT-4 assistant, combined with the power of YoloV8 realtime object detection, Whisper speech recognition, text to speech and google browsing feature.
☆17Updated last year
Alternatives and similar repositories for gpt-4-enhanced
Users that are interested in gpt-4-enhanced are comparing it to the libraries listed below
Sorting:
- A project using YoloV8 to detect License Plates☆12Updated 2 years ago
- Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand English, German and Albanian. Ba…☆40Updated last month
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆120Updated 2 years ago
- ☆17Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- Analyze traffic flow with YOLOv8 and ByteTrack: Vehicle detection and tracking, speed estimation, path outlining, and direction analysis …☆19Updated last year
- autoAnnoter its a tool to auto annotate data using a exisiting models☆45Updated last year
- YOLOv8 object detection, tracking, image segmentation and pose estimation app using Ultralytics API (for detection, segmentation and pose…☆79Updated 2 years ago
- Self-hosted AI voice agent☆124Updated last year
- LipSyncr is a lip reading web app based on the LipNet model that can lip read videos.☆76Updated 2 years ago
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆18Updated 2 years ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆40Updated 2 years ago
- ☆11Updated 2 years ago
- Detecting facial emotions using YOLOv8 model and deep learning techniques☆10Updated last year
- Talking head video AI generator☆81Updated last year
- This repository is about an APP to help lawyers to process law documents and suit cases using AI Agents trained with OpenAI and others LL…☆18Updated 2 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 2 months ago
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆44Updated last year
- 🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra…☆32Updated 7 months ago
- Vehicle speed estimation using YOLOv8☆32Updated last year
- This repo explains the custom object detection training using Yolov8.☆17Updated 2 years ago
- Multimodal AI App using Llava 7B and Gradio.☆39Updated last year
- This is a GUI application that integrates YOLOv8 object recognition with OpenAI's GPT-3 language generation model.☆36Updated 2 years ago
- Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information☆30Updated last year
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆95Updated last year
- This model is very useful to detecting cars, buses, and trucks in a video.☆31Updated 5 months ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆23Updated last month
- This repo is a packaged version of the Yolov9 model.☆87Updated last month
- A comprehensive tool for processing and analyzing video footage, producing detailed insights into gameplay and player performance enhanci…☆149Updated 11 months ago