safety-research / assistant-axisView external linksLinks
The Assistant Axis is a direction in activation space that captures how "Assistant-like" a model's behavior is. Models can drift away from the Assistant during conversations—sometimes toward bizarre or harmful personas. This repo contains a pipeline for generating the Assistant Axis and notebooks for monitoring and steering with it.
☆69Jan 20, 2026Updated 3 weeks ago
Alternatives and similar repositories for assistant-axis
Users that are interested in assistant-axis are comparing it to the libraries listed below
Sorting:
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated 3 weeks ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 3 months ago
- Convert source code to LLM ready knowledge base☆28Dec 30, 2025Updated last month
- Contact Page to send E-mails. Sends requests to Cloudflare Worker. Uses Google Invisible Captcha to reduce spam, Sentry to log errors & S…☆11Jan 9, 2023Updated 3 years ago
- e2e encrypted secret sharing via p2p exchanges & embedded vaults☆11Feb 1, 2026Updated last week
- A comprehensive guide for web developers to master DevOps practices in their workflow.☆10Nov 2, 2023Updated 2 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- A Spotify 'Now Playing' screen designed for Raspberry Pi☆12Jun 17, 2025Updated 7 months ago
- Autodoc project, aimed to autonomously generating documentation for any code repository and store info about it in vectorstore to further…☆14Nov 19, 2025Updated 2 months ago
- ☆13Nov 2, 2021Updated 4 years ago
- Volvo P2 (S60) RTI retrofit with Android Auto, Carplay, Handsfree etc.☆17Sep 29, 2025Updated 4 months ago
- A Serverless Scheduler and Queue system built on top of cloudflare workers, D1 and Durable Objects to handle scale and schedule/queue mil…☆11Dec 15, 2024Updated last year
- Using AnyCable and Hanami to build an app to process Twilio Media streams☆13Nov 12, 2024Updated last year
- A fast, simple image media proxy written in Rust that runs on Cloudflare Workers.☆11Dec 10, 2024Updated last year
- A generative UI component framework☆17Nov 26, 2024Updated last year
- Discord bot for "button" roles, using Cloudflare Workers☆10Aug 13, 2025Updated 6 months ago
- A tutorial system for learning the shell/command line. For Windows and macOS/Linux.☆10Feb 7, 2021Updated 5 years ago
- A starter kit for building secure ai agents on Cloudflare with Auth0☆20Dec 4, 2025Updated 2 months ago
- Anonymous Chat Cloudflare Worker Telegram Bot☆11Jan 31, 2025Updated last year
- Repo for the testing-genai workshop☆13May 8, 2025Updated 9 months ago
- ☆16May 31, 2025Updated 8 months ago
- O Crawler Shein é um projeto de Web Scraping automatizado desenvolvido para extrair informações detalhadas de produtos no site Shein.☆13Apr 18, 2024Updated last year
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- Sample implementations of Google's Agent Development Kit (ADK) with various agent types, tool integrations, and orchestration patterns.☆18May 14, 2025Updated 9 months ago
- ☆15May 22, 2023Updated 2 years ago
- AI-powered flashcard generator built with React and Google Gemini . Create and customize quiz content seamlessly for an interactive learn…☆12Jan 4, 2026Updated last month
- The simplest roadmap of becoming a solid Software engineer☆14Feb 9, 2019Updated 7 years ago
- LLM model runway server☆13Sep 13, 2023Updated 2 years ago
- A serverless discord bot template that runs on Cloudflare Workers☆18Sep 17, 2025Updated 4 months ago
- [Retired] Prototype idea for a multi-threaded ecs☆14Mar 27, 2020Updated 5 years ago
- Ruby Next browser playground☆15Jan 13, 2026Updated last month
- LLM-based prototype for nexgen AutoML☆12Oct 9, 2025Updated 4 months ago
- Cloudflare Worker to analyze SEO of input website using Cloudflare browser rendering and Llama-3.2 hosted on Workers AI☆11Jan 22, 2025Updated last year
- javascript undetected automation, based on playwright without the detection☆17Aug 26, 2025Updated 5 months ago
- UUIDs-as-a-Service to generate UUIDs. Powered by Cloudflare Workers☆13Dec 22, 2025Updated last month
- Simple example of stereo portal rendering in VR with Unity on Meta Quest.☆15Mar 24, 2024Updated last year
- The homepage for ConvSearch Dataset.☆14May 31, 2022Updated 3 years ago
- Messaging between C++ and Python using RabbitMQ☆11Aug 17, 2018Updated 7 years ago
- Disentangled Synthesis for Domain Adaptation☆13Nov 1, 2018Updated 7 years ago