honeyhiveai / realign
Realign is an evaluation and experimentation framework for AI applications.
☆12Updated last month
Related projects ⓘ
Alternatives and complementary repositories for realign
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 3 months ago
- Small, simple agent task environments for training and evaluation☆16Updated 3 weeks ago
- Fast-track AI apps to production with LLaMA 3, Mistral, and other top LLMs!☆17Updated 4 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆14Updated last month
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆12Updated last month
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆20Updated last week
- ☆37Updated this week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆41Updated last month
- PII Masker is an open-source tool for protecting sensitive data by automatically detecting and masking PII using advanced AI, powered by …☆41Updated this week
- Streamlit app for recommending eval functions using prompt diffs☆25Updated 10 months ago
- A visual tool to interpret and understand PyTorch machine learning models☆15Updated 9 months ago
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆63Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆11Updated last week
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- ☆18Updated this week
- ☆26Updated 8 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆86Updated 5 months ago
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆41Updated last month
- ☆25Updated 2 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- ☆38Updated 4 months ago
- ☆20Updated 9 months ago
- AI Evaluation Platform☆45Updated 2 weeks ago
- This repository implements DSPy programs to tasks in Indian Languages☆11Updated 10 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆44Updated 5 months ago
- ☆11Updated last month
- ☆36Updated 3 months ago
- LLM reads a paper and produce a working prototype☆36Updated last week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago