honeyhiveai / realign
Realign is a testing and simulation framework for AI applications.
☆16Updated 4 months ago
Alternatives and similar repositories for realign:
Users that are interested in realign are comparing it to the libraries listed below
- One Line To Build Zero-Data Classifiers in Minutes☆53Updated 7 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- Sphynx Hallucination Induction☆53Updated 2 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 8 months ago
- Pre-train Static Word Embeddings☆56Updated 2 weeks ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Estimate costs of complex LLM workflows in advance before spending money☆10Updated 2 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆15Updated this week
- ☆20Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- Python library to use Pleias-RAG models☆27Updated this week
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 7 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 6 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- ☆48Updated last year
- ☆41Updated 2 months ago
- 💻 An open-source vibe-coding platform today. The next generation IDE tomorrow.☆19Updated this week
- ☆19Updated 6 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Tools for formatting large language model prompts.☆13Updated last year
- A library for red-teaming LLM applications with LLMs.☆26Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated last week
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆89Updated 2 weeks ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆67Updated 5 months ago