Ryaang / Web-page-Screenshot-Segmentation
Automatically split long webpage screenshots into chunks for input into models with shorter contexts. 自动将长网页截图进行区块分割,用于输入上下文较短的模型
☆17Updated 5 months ago
Alternatives and similar repositories for Web-page-Screenshot-Segmentation:
Users that are interested in Web-page-Screenshot-Segmentation are comparing it to the libraries listed below
- Unofficial Pytorch implementation of Dom-LM paper.☆33Updated 2 years ago
- Mark web pages for use with vision-language models☆30Updated 2 months ago
- Open Agent Computer Interface☆58Updated 4 months ago
- A full-stack framework for building AI workflows☆65Updated last month
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated last year
- ☆26Updated last year
- Chat interface that searches the web for you real-time☆89Updated 5 months ago
- A collection of cookbooks to help developers get started quickly with the Firecrawl API.☆43Updated last month
- Yet Another Web Extraction SDK☆33Updated this week
- CLI to set up and deploy MCP Servers to Cloudflare Workers in seconds. Just write TypeScript functions to make Cursor MCP tools.☆27Updated last month
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆35Updated 10 months ago
- Framework to evaluate LLM generated ReactJS code.☆54Updated last year
- Self-hosted version of Microsoft's OmniParser Image-to-text model☆56Updated 3 months ago
- A shared set of "plug and play" functions that can be invoked by OpenAI function calling mechanism.☆10Updated 8 months ago
- Jina Reader MCP Server☆31Updated 3 months ago
- Build event-driven workflows with python async functions☆33Updated 6 months ago
- ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K …☆107Updated last month
- GPT Librarian understands all your docs☆12Updated last year
- A function to do all☆36Updated 11 months ago
- Supervised fine-tuning of Google's open-source Gemma-2B model to optimize writing Python code☆21Updated last year
- LLM-ready data connectors☆74Updated 10 months ago
- Collection of Materials on AI Agents☆33Updated last month
- Discover existing open source projects 10x faster using AI search. This project leverages Vercel AI SDK, OpenAI & Tavily REST API to anal…☆65Updated 8 months ago
- Create a GPT chatbot for any GitHub repo in just 30 seconds☆38Updated 9 months ago
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆21Updated 6 months ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆50Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆22Updated 2 weeks ago
- Code interpreter support for o1☆32Updated 6 months ago
- GUI Grounding for Professional High-Resolution Computer Use☆149Updated last month
- ☆22Updated 8 months ago