shreyashankar / spade-experimentsLinks
Experiments to assess SPADE on different LLM pipelines.
☆17Updated last year
Alternatives and similar repositories for spade-experiments
Users that are interested in spade-experiments are comparing it to the libraries listed below
Sorting:
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆24Updated 3 months ago
- ☆29Updated 2 years ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆29Updated 7 months ago
- Finding semantically meaningful and accurate prompts.☆47Updated last year
- Lottery Ticket Adaptation☆39Updated 7 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆51Updated last month
- ☆23Updated 2 years ago
- ☆22Updated last month
- ☆18Updated last month
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Updated last year
- A library for squeakily cleaning and filtering language datasets.☆47Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 9 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 3 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- ☆27Updated 2 weeks ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated 2 weeks ago
- ☆19Updated last week
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- The repository contains generative AI analytics platform application code.☆26Updated 2 months ago
- This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"☆24Updated 2 years ago
- ☆16Updated last week
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 8 months ago
- ☆30Updated 8 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated last year
- Entailment self-training☆25Updated 2 years ago