BasedHardware / OpenGlass
Turn any glasses into AI-powered smart glasses
☆3,346Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for OpenGlass
- AI wearables☆3,688Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,663Updated last month
- An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own se…☆2,951Updated 7 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,840Updated 3 months ago
- Inference and training library for high-quality TTS models.☆4,663Updated 3 weeks ago
- PDF to Markdown with vision models☆6,519Updated this week
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆5,977Updated last month
- Brand new TTS solution☆14,611Updated last week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,006Updated this week
- Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema☆2,071Updated this week
- Open source Claude Artifacts – built with Llama 3.1 405B☆3,583Updated last week
- tiny vision language model☆5,798Updated this week
- rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get…☆9,010Updated this week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,160Updated 4 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,258Updated last week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,659Updated 4 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,649Updated 4 months ago
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆7,574Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆10,526Updated this week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆6,917Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,199Updated 2 weeks ago
- An AI-powered search engine with a generative UI☆6,316Updated 2 weeks ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆7,680Updated this week
- ☆4,818Updated 3 months ago
- Private & local AI personal knowledge management app for high entropy people.☆7,185Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,553Updated 3 weeks ago
- Real time interactive streaming digital human☆3,955Updated this week
- Build real-time multimodal AI applications 🤖🎙️📹☆4,032Updated this week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆6,079Updated this week
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆5,150Updated 3 months ago