Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedback"
☆56Mar 25, 2026Updated last month
Alternatives and similar repositories for ToolSafe
Users that are interested in ToolSafe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆82May 2, 2026Updated 2 weeks ago
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆58Mar 13, 2026Updated 2 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆45Mar 24, 2026Updated last month
- [ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆47Apr 17, 2026Updated last month
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Mar 12, 2026Updated 2 months ago
- ☆14Dec 18, 2024Updated last year
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 3 months ago
- The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".☆143Feb 12, 2026Updated 3 months ago
- The code for paper "Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry", acc…☆71Feb 3, 2026Updated 3 months ago
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆35Mar 5, 2026Updated 2 months ago
- TBD☆56Mar 13, 2026Updated 2 months ago
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 3 months ago
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆47Mar 26, 2026Updated last month
- Open Ended Medical Reinforcement Learning☆54Mar 15, 2026Updated 2 months ago
- ☆22Jun 16, 2025Updated 11 months ago
- ☆19Aug 3, 2024Updated last year
- ☆36Jan 30, 2026Updated 3 months ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆66Feb 25, 2026Updated 2 months ago
- ☆44Mar 23, 2026Updated last month
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Awesome Long-CoT Data☆20Mar 26, 2025Updated last year
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆162Jan 20, 2026Updated 4 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆30Nov 4, 2025Updated 6 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 11 months ago
- ☆13Sep 26, 2025Updated 7 months ago
- ☆14Feb 26, 2024Updated 2 years ago
- Green-VLA: Staged Vision-Language-Action Model for Generalist Robots☆131Mar 5, 2026Updated 2 months ago
- ☆69Feb 6, 2026Updated 3 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆60Jul 24, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆75Nov 27, 2025Updated 5 months ago
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆53Feb 10, 2026Updated 3 months ago
- ☆56Mar 18, 2026Updated 2 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆67Jan 28, 2026Updated 3 months ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆67Updated this week