philfung / awesome-computer-useLinks
Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.
☆23Updated 10 months ago
Alternatives and similar repositories for awesome-computer-use
Users that are interested in awesome-computer-use are comparing it to the libraries listed below
Sorting:
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated 3 weeks ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆26Updated 4 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated this week
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆52Updated 8 months ago
- ☆54Updated this week
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 5 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆113Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆57Updated 7 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last month
- Use this code to access pipeline to Gemini from inside notebookLM☆32Updated last year
- A framework for hosting and scaling AI agents.☆38Updated 10 months ago
- Modified Beam Search with periodical restart☆12Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆105Updated 2 months ago
- ☆57Updated 8 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆69Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆17Updated this week
- make your own NotebookLM clone with OpenAI + ElevenLabs + Cartesia☆38Updated 11 months ago
- Open Sourced NoteBookLM☆58Updated last year
- ☆21Updated 11 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆23Updated last year
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆44Updated 8 months ago
- ☆22Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 10 months ago
- This project is a **proof of concept** that aims to replicate the reasoning capabilities of OpenAI's newly released O1 model.☆90Updated 8 months ago
- MCP Server implementation for Claude☆26Updated 10 months ago