haseeb-heaven / gemini-vision-proLinks
Google Gemini Vision Web application with Speech and Text
☆45Updated last year
Alternatives and similar repositories for gemini-vision-pro
Users that are interested in gemini-vision-pro are comparing it to the libraries listed below
Sorting:
- Voice Craft is a desktop AI assistance tool designed to help people with disabilities operate a computer using their voice. This tool can…☆16Updated 2 years ago
- Example use cases for the GPT-4 Vision API☆18Updated last year
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆27Updated last year
- 🤖 Your Personalised AI Chat Companion With 50+ Avatars Over 10+ Categories - Powered by OpenAI's GPT-3 / ChatGPT-3.5 Turbo / GPT-4, Goog…☆52Updated 10 months ago
- Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description☆76Updated last year
- ☆89Updated last year
- Generate resume summary and cover letter using the help of crewAI☆22Updated last year
- A modify of AutoGPT to AutoCluade. Use the 100k api.☆43Updated last year
- Python Streamlit web app utilizing OpenAI (GPT4) and LangChain LLM tools with access to Wikipedia, DuckDuckgo Search, and a ChromaDB with…☆72Updated last year
- talking to AI, in voice☆33Updated last year
- ☆47Updated last year
- ☆12Updated last year
- Create podcasts about any subject using ChatGPT and ElevenLabs☆30Updated last year
- Minimal example of using the Superagent.sh SDK with NextJS☆45Updated 2 years ago
- A simple playground Web UI for using the Gemini Pro Vision and Gemini Pro AI models with Next.js☆85Updated last year
- AI leetcode interviewer that assesses tech applicants. Built on Langchain and OpenAI APIs. Recruiter-focused and tracks progress and subm…☆14Updated 2 years ago
- Modern AI chatbot supporting multiple LLMs. Switch between Gemini, Mistral, Llama, Claude and ChatGPT.☆56Updated 3 months ago
- AI Agent Demo Using GPT Function Calling☆12Updated last year
- Building Apps with GPT-4-turbo with vision API and Databutton☆25Updated last year
- ☆97Updated last year
- Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on…☆38Updated 6 months ago
- Talking Santa-GPT with Speech Recognition☆13Updated last year
- Web app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summar…☆65Updated last year
- ☆23Updated last year
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Updated last year
- Crawl Websites to Markdown.☆38Updated 9 months ago
- A tutorial about cloning gosameday.com☆29Updated last year
- An intellligent AI assistant that can do anything!☆54Updated last year
- Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit☆61Updated last year
- A simple chat app with vision using Next.js, Vercel AI SDK, and GPT-4V.☆13Updated last year