philfung / awesome-computer-useLinks
Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.
☆23Updated last week
Alternatives and similar repositories for awesome-computer-use
Users that are interested in awesome-computer-use are comparing it to the libraries listed below
Sorting:
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Updated 3 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 2 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Updated 3 months ago
- Open Sourced NoteBookLM☆61Updated last year
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated 2 years ago
- A daemon that makes a desktop OS accessible to AI agents☆38Updated 7 months ago
- make your own NotebookLM clone with OpenAI + ElevenLabs + Cartesia☆39Updated last year
- Very minimal (and stateless) agent framework☆44Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆30Updated last year
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆15Updated last month
- Use this code to access pipeline to Gemini from inside notebookLM☆34Updated last year
- Gradio chat interface for FastMLX☆12Updated last year
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- ☆57Updated this week
- AI Agent capable of automating various tasks using MCP☆40Updated 9 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated last year
- A framework for hosting and scaling AI agents.☆39Updated last year
- ☆42Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆19Updated last week
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆32Updated 7 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- MCP Server implementation for Claude☆26Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆25Updated last year
- Opensource chat app that uses Exa's API for web search and OpenAI o3-mini☆43Updated 7 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- Call another MCP client from your MCP client. Offload context windows, delegate tasks, split between models☆30Updated 11 months ago
- an auto coder which automatically fixes errors and improves the code from simple user prompt☆37Updated last year