egozverev / Should-It-Be-Executed-Or-Processed
Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.
☆46Updated 7 months ago
Alternatives and similar repositories for Should-It-Be-Executed-Or-Processed:
Users that are interested in Should-It-Be-Executed-Or-Processed are comparing it to the libraries listed below
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆20Updated 2 weeks ago
- Using modal.com to process FineWeb-edu data☆19Updated last month
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 5 months ago
- utilities for loading and running text embeddings with onnx☆42Updated 5 months ago
- ☆46Updated 2 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated 2 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 10 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 2 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆49Updated 10 months ago
- ☆56Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- ☆20Updated 2 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆68Updated last month
- ☆48Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆33Updated 3 months ago
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- ☆67Updated 5 months ago
- Very minimal (and stateless) agent framework☆41Updated this week
- Chat Markup Language conversation library☆55Updated last year
- Lightweight tools for quick and easy LLM demo's☆26Updated 3 months ago