leap-laboratories / PIZZA
An attribution library for LLMs
☆34Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for PIZZA
- ☆18Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Tools to make language models a bit easier to use☆30Updated this week
- ☆48Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆47Updated last month
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆28Updated 9 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 8 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Small, simple agent task environments for training and evaluation☆16Updated 3 weeks ago
- Verbosity control for AI agents☆59Updated 5 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- ☆24Updated last year
- ☆36Updated 3 months ago
- Chat Markup Language conversation library☆54Updated 10 months ago
- ☆41Updated 2 weeks ago
- PyTorch implementation for MRL☆18Updated 9 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆11Updated 3 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- Training code for Sparse Autoencoders on Embedding models☆33Updated 3 weeks ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆63Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆47Updated this week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- ☆75Updated 5 months ago
- Automatic Evals for Instruction-Tuned Models☆45Updated this week