iMeanAI / WebCanvas
Connect agents to live web environments evaluation.
☆224Updated 3 weeks ago
Alternatives and similar repositories for WebCanvas:
Users that are interested in WebCanvas are comparing it to the libraries listed below
- Toolkit for Prompt Compression☆243Updated 3 months ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆187Updated 4 months ago
- Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering☆102Updated 6 months ago
- [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering☆134Updated 2 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆293Updated this week
- ☆116Updated 7 months ago
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆151Updated 2 months ago
- ☆208Updated last month
- ☆335Updated 2 months ago
- Prompt Learning using Metaheuristics☆135Updated 11 months ago
- GPT-4 level function calling models for real-world tool using use cases☆228Updated 3 months ago
- xFinder: Robust and Pinpoint Answer Extraction for Large Language Models☆148Updated last week
- A framework offers an OS simulator within a Python Code Interface for AI Agents☆53Updated this week
- MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆163Updated 3 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆139Updated 9 months ago
- Multi-agent to generate LangGPT prompts.☆110Updated last week
- [COLING 2025] Official code of the paper "The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models"☆46Updated 3 weeks ago
- A package for parsing PDFs and analyzing their content using LLMs.☆256Updated 5 months ago
- Llama-github is an open-source Python library that empowers LLM Chatbots, AI Agents, and Auto-dev Solutions to conduct Agentic RAG from a…☆268Updated last month
- Video QA Assistant based on LLMs with frame convolution☆205Updated last year
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆189Updated 2 months ago
- LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in …☆258Updated this week
- Building a comprehensive and handy list of papers for GUI agents☆163Updated last week
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆203Updated this week
- ☆177Updated last year
- ☆375Updated 5 months ago
- ☆37Updated 5 months ago
- Building Intelligent Applications: The Powerful Combination of LangChain, Neo4j, and GraphRAG☆46Updated last month
- [NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"☆159Updated 2 weeks ago
- Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024☆119Updated last month