Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedback"
☆34Jan 23, 2026Updated last month
Alternatives and similar repositories for ToolSafe
Users that are interested in ToolSafe are comparing it to the libraries listed below
Sorting:
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆50Jan 18, 2026Updated last month
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆54Feb 11, 2026Updated 2 weeks ago
- ☆14Dec 18, 2024Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆41Feb 10, 2026Updated 2 weeks ago
- A documentation system that captures not just what you built, but why, how, and what you learned. Designed for human-LLM collaboration.☆31Jan 13, 2026Updated last month
- ☆44Feb 13, 2026Updated 2 weeks ago
- The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".☆95Feb 12, 2026Updated 2 weeks ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆28Nov 4, 2025Updated 3 months ago
- Measuring RAG solutions throughput and latency☆19Jul 23, 2024Updated last year
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆61Updated this week
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆51Jan 28, 2026Updated 3 weeks ago
- A Knowledge-grounded framework for Autonomous ML/AI Program Synthesis and Optimization☆74Feb 20, 2026Updated last week
- TBD☆40Feb 3, 2026Updated 3 weeks ago
- On demand communication☆32Feb 12, 2026Updated 2 weeks ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆61Jan 28, 2026Updated last month
- Official implementation of Log-linear Sparse Attention (LLSA).☆58Feb 2, 2026Updated 3 weeks ago
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆27Sep 24, 2025Updated 5 months ago
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆94Jan 20, 2026Updated last month
- 📚【更新中】AI-Driven Enterprise Security: Architecture, Methodology, and Practice:AI驱动的企业安全建设实战,覆盖安全架构设计、方法论框架与工程实践,系统化提出 AISecOps 方法论框架,将 AI…☆83Jan 31, 2026Updated 3 weeks ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- sora2 free watermark remover☆767Feb 20, 2026Updated last week
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆34Dec 6, 2025Updated 2 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…☆114Jan 30, 2026Updated 3 weeks ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆89Feb 11, 2026Updated 2 weeks ago
- ☆43Feb 9, 2026Updated 2 weeks ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor☆29Jan 13, 2026Updated last month
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆123Feb 6, 2026Updated 3 weeks ago
- ☆13Oct 21, 2024Updated last year
- ☆24Dec 19, 2025Updated 2 months ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆32Nov 11, 2025Updated 3 months ago
- ☆14May 1, 2023Updated 2 years ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Feb 20, 2026Updated last week
- ☆16Jan 16, 2025Updated last year
- 🚀 100% local RAG system with one-command setup. Your data never leaves your server. AGPL-3.0☆29Updated this week
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago