lechmazur / deception
Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation metrics.
☆13Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for deception
- ☆16Updated last month
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆19Updated last month
- ☆40Updated 6 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆41Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 4 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 9 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆27Updated this week
- look how they massacred my boy☆54Updated 3 weeks ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated 9 months ago
- Build Web Datasets with Ease☆33Updated 4 months ago
- ☆31Updated 2 weeks ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- The next evolution of Agents☆45Updated this week
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆32Updated 6 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG☆49Updated last week
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆20Updated 4 months ago
- Branch Out Your Conversations☆22Updated 2 weeks ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆55Updated last week
- Routing on Random Forest (RoRF)☆83Updated last month
- ☆46Updated 7 months ago
- ☆48Updated 5 months ago
- Structured outputs from DSPy and Jinja2☆14Updated last week
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 6 months ago
- ☆31Updated last year
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- A simple library for working with Hugging Face models.☆15Updated 2 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 5 months ago