☆26Nov 19, 2025Updated 6 months ago
Alternatives and similar repositories for WildVisualizer
Users that are interested in WildVisualizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆51Apr 4, 2025Updated last year
- ☆12Jun 5, 2024Updated last year
- ☆18Mar 5, 2017Updated 9 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆14Aug 9, 2017Updated 8 years ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆75Apr 22, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACL 2026] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆61Apr 13, 2026Updated last month
- Auditing agents for fine-tuning safety☆21Oct 21, 2025Updated 6 months ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- Example formalization of Game Theoretic concepts in Lean☆27Feb 14, 2025Updated last year
- ☆15Dec 5, 2019Updated 6 years ago
- ☆19Mar 25, 2025Updated last year
- ☆14Mar 13, 2021Updated 5 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 10 months ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for rein…☆35Aug 11, 2025Updated 9 months ago
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆64Oct 9, 2024Updated last year
- Code for evaluating AI systems on the MASK honesty benchmark.☆22Mar 6, 2025Updated last year
- To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models☆33May 21, 2025Updated 11 months ago
- ☆30Feb 11, 2022Updated 4 years ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆23Apr 9, 2026Updated last month
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated last year
- Computer Environments Elicit General Agentic Intelligence in LLMs☆229Apr 11, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆45Dec 8, 2022Updated 3 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- Example agents for the Dreadnode platform☆33Dec 19, 2025Updated 5 months ago
- [AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data☆34Apr 7, 2025Updated last year
- ☆18Mar 30, 2025Updated last year
- A Qt5 app that plots timestamped MQTT data – status: unfinished alpha software.☆10May 7, 2022Updated 4 years ago
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- Collections of Actions for Custom GPTs (some created by Captain Action)☆11Jan 7, 2024Updated 2 years ago
- Scripts for medium posts☆26Oct 28, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆33Updated this week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- Rubik ESP32 esp-idf Device driver library.☆12Jul 3, 2021Updated 4 years ago
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 10 months ago
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>☆76Jan 8, 2026Updated 4 months ago
- Verilog code for a low power RFID chip that will communicate with I2C sensors.☆13Apr 18, 2014Updated 12 years ago