jalbrethsen / double-agentLinks
☆12Updated 3 months ago
Alternatives and similar repositories for double-agent
Users that are interested in double-agent are comparing it to the libraries listed below
Sorting:
- ☆30Updated last year
- Information Processing Evaluation for Large Language Models☆31Updated 3 weeks ago
- Pivotal Token Search☆131Updated 4 months ago
- A powerful document processing tool that uses Google's Gemini AI to generate high-quality Thai language summaries from PDF and EPUB files…☆24Updated 6 months ago
- Repository for CoSAI Workstream 4, Secure Design Patterns for Agentic Systems☆26Updated last month
- The DPAB-α Benchmark☆30Updated 10 months ago
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆72Updated 4 months ago
- ~ streaming agents☆74Updated last week
- Use LLMs for document ranking☆158Updated 7 months ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆157Updated this week
- Securely run AI-generated code in stateful sandboxes that run forever.☆224Updated 7 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Updated 7 months ago
- ☆162Updated 7 months ago
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆190Updated last month
- Nyxelf is a highly effective tool tailored for analyzing malicious Linux ELF binaries, offering comprehensive support for both static and…☆121Updated 3 weeks ago
- ☆35Updated last year
- A web application that converts speech to speech 100% private☆80Updated 5 months ago
- Chat strategies for LLMs☆108Updated last year
- Detecting Inconsistencies in Feature or Function Evaluations of Requirements☆68Updated last year
- Merliot Device Hub☆166Updated 5 months ago
- A character-level language diffusion model trained on Tiny Shakespeare☆330Updated this week
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆222Updated 11 months ago
- Live-bending a foundation model’s output at neural network level.☆270Updated 7 months ago
- Your appetite for code + Claude's capabilities = Limitless creation. No experience required - just pure hunger! 🧠⚡💻☆58Updated 4 months ago
- A Python framework for building AI agent systems with robust task management in the form of a graph execution engine, inference capabilit…☆31Updated 5 months ago
- Ultra-lightweight AI Agent☆417Updated 2 months ago
- Pragmatic framework to build LLM Copilots☆63Updated 8 months ago
- Repo for the testing-genai workshop☆13Updated 6 months ago
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆76Updated 2 months ago