ShaoShuai0605/Misevolution

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShaoShuai0605/Misevolution)

ShaoShuai0605 / Misevolution

Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

☆90

Alternatives and similar repositories for Misevolution

Users that are interested in Misevolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI45Lab / DeepScan
View on GitHub
Diagnostic Framework for LLMs and MLLMs
☆39Mar 2, 2026Updated 4 months ago
AI45Lab / DeepSafe
View on GitHub
All-in-One Safety Evaluation Framwork
☆51Jul 15, 2026Updated last week
Nebularaid2000 / rethink_sft_generalization
View on GitHub
Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"
☆108Apr 23, 2026Updated 3 months ago
QingyuLiu / Agentic-Upward-Deception
View on GitHub
This repo is the official implementation of “Are Your Agents Upward Deceivers?”. The paper is accepted by ICML 2026.
☆24Dec 15, 2025Updated 7 months ago
paraynaud / MTH8408-Hiv24
View on GitHub
☆15Mar 25, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
LiYu0524 / ATbench
View on GitHub
ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis
☆33Jul 10, 2026Updated 2 weeks ago
akhileshthite / zipify-tunes
View on GitHub
Convert any playlist CSVs into MP3 files with metadata, bring back your MP3 player!
☆20Jul 4, 2026Updated 3 weeks ago
tensake / litehook
View on GitHub
Lightweight social media monitoring tool built with Rust
☆17Jun 10, 2026Updated last month
yjyddq / RiOSWorld
View on GitHub
[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
☆123Dec 2, 2025Updated 7 months ago
yjyddq / EOSER-ASS-RL
View on GitHub
Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…
☆28Mar 9, 2026Updated 4 months ago
JackXing875 / NeneBot
View on GitHub
綾地寧々は世界一可愛い！
☆16Jul 14, 2026Updated last week
Belyenochi / openclaw-edd
View on GitHub
Evaluation-Driven Development for OpenClaw agents — mine golden cases from real sessions, catch regressions before they ship.
☆18Mar 17, 2026Updated 4 months ago
ChnQ / TracingLLM
View on GitHub
☆30May 22, 2024Updated 2 years ago
vishnu97770 / VELOTYPE
View on GitHub
Adaptive AI-powered typing practice system that analyzes repeated user mistakes and generates personalized corrective tasks using FastAPI…
☆15Jun 6, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
changwxx / ShanghaiTech-poster-template
View on GitHub
A self-made NeurIPS poster template, infused with the unique design style of ShanghaiTech.
☆18Dec 26, 2023Updated 2 years ago
jaredrummler / consoul
View on GitHub
A beautiful terminal-based AI chat interface built with Textual and LangChain
☆15Jan 7, 2026Updated 6 months ago
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
AI45Lab / AgentDoG
View on GitHub
A Diagnostic Guardrail Framework for AI Agent Safety and Security
☆669Jun 8, 2026Updated last month
Jivoronix / blockchain-data-validator
View on GitHub
☆15Jan 31, 2025Updated last year
dannysun85 / ClawX
View on GitHub
依托ClawX完全重构和增加了全新的功能！
☆16Feb 23, 2026Updated 5 months ago
timjuenemann / wikipedia-mcp
View on GitHub
Wikipedia MCP Server written in TypeScript
☆15Apr 18, 2025Updated last year
mattkang0 / weatpy
View on GitHub
A python implementation of wego
☆15Oct 15, 2016Updated 9 years ago
omar-A-hassan / medsci-agent
View on GitHub
Biomedical research agent with 28 MCP tools powered by MedGemma, TxGemma and OpenCode
☆18Mar 14, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tbrumue / heapo
View on GitHub
HEAPO – An Open Dataset for Heat Pump Optimization with Smart Electricity Meter Data and On-Site Inspection Protocols
☆16Mar 26, 2025Updated last year
junminhong / awesome-agent-skills
View on GitHub
A curated list of agent skills, resources, and tools for building customizable AI workflows (Claude Code, Codex, Kiro-CLI)
☆15Updated this week
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
lsm1103 / session-dashboard
View on GitHub
Used to browse and monitor the historical session records of AI programming tools（Claude Code、Codex CLI、Cursor、Aider）
☆16Mar 18, 2026Updated 4 months ago
ycdfwzy / PL-MSCKF
View on GitHub
☆16Jan 6, 2023Updated 3 years ago
acgessler / rust-persistent-kv
View on GitHub
Persistent fault-tolerant key-value store in rust
☆16Mar 17, 2025Updated last year
pretty66 / fastcar
View on GitHub
PHP long connection proxy, eliminates short links and reduces request latency
☆20Oct 4, 2023Updated 2 years ago
aaron-ang / opthash-rs
View on GitHub
Optimal open-addressing hash maps (Elastic Hashing & Funnel Hashing) in Rust, with Python bindings.
☆16Updated this week
andrioid / ublproxy
View on GitHub
☆19Mar 6, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
norfablabs / picle
View on GitHub
Python Interactive Command Line Shells
☆15Jun 28, 2026Updated 3 weeks ago
FerrisMind / twill
View on GitHub
Idiomatic Rust styling library inspired by Tailwind for Native GUI
☆17Jun 30, 2026Updated 3 weeks ago
Carkham / FedSD2C
View on GitHub
(NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication
☆20Mar 11, 2025Updated last year
romeomanoela / jery
View on GitHub
Jery is a server monitoring and management application. It provides a clean and modern interface to track your server performance metri…
☆16Feb 19, 2026Updated 5 months ago
arhamkhnz / inspectcn
View on GitHub
Chrome extension to inspect and extract shadcn-style theme tokens from any website, then bring them into your project.
☆16Apr 3, 2026Updated 3 months ago
kang-1-2-3 / OPAL
View on GitHub
Code release for CoRL'25 paper "Opal: Visibility-aware lidar-to-openstreetmap place recognition via adaptive radial fusion"
☆15Mar 9, 2026Updated 4 months ago
new-sashok724 / windows.sh
View on GitHub
My configuration and other files for QEMU to launch Windows VM with GPU passthrough (VFIO)
☆16Jun 19, 2022Updated 4 years ago