OPTML-Group / Unlearn-TraceLinks
Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs
☆19Updated this week
Alternatives and similar repositories for Unlearn-Trace
Users that are interested in Unlearn-Trace are comparing it to the libraries listed below
Sorting:
- IAttorney is an intelligent legal assistant built using Flask, LangChain, FAISS, OpenAI, and a RAG (Retrieval-Augmented Generation) pipel…☆11Updated 2 months ago
- ☆18Updated last month
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 7 months ago
- ☆13Updated 6 months ago
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆48Updated 4 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 6 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- A framework for hosting and scaling AI agents.☆35Updated 7 months ago
- XmodelLM☆39Updated 7 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆15Updated 3 months ago
- ☆16Updated 3 months ago
- ☆77Updated 7 months ago
- Easiest way to build custom agents, in a no-code notion style editor, using simple macros.☆27Updated 7 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆12Updated 6 months ago
- Make DSPy Agentic using protocol-first approach that support the Agent Protocols like MCP, A2A☆28Updated last month
- ☆63Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆31Updated 2 months ago
- ☆18Updated last year
- ☆10Updated 2 months ago
- ☆21Updated 7 months ago
- AI conflict resolution framework designed to work alongside existing AI orchestration tools☆24Updated 6 months ago
- adapt data to and from every format☆19Updated last week
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 2 months ago
- ☆1Updated 11 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆35Updated last month
- Fetch message history from discord for LLMs☆15Updated 3 weeks ago