shreyashankar / spade-experimentsLinks
Experiments to assess SPADE on different LLM pipelines.
☆17Updated last year
Alternatives and similar repositories for spade-experiments
Users that are interested in spade-experiments are comparing it to the libraries listed below
Sorting:
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆28Updated 6 months ago
- Finding semantically meaningful and accurate prompts.☆48Updated last year
- ☆19Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 9 months ago
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 8 months ago
- ☆29Updated 2 months ago
- ☆55Updated 11 months ago
- ☆47Updated 6 months ago
- ☆78Updated 6 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 8 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆19Updated 2 weeks ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆66Updated last year
- ☆40Updated 3 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last week
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆31Updated 10 months ago
- ☆45Updated 2 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆56Updated 3 weeks ago
- A repository for research on medium sized language models.☆78Updated last year
- Lottery Ticket Adaptation☆40Updated 10 months ago
- ☆26Updated last year
- Entailment self-training☆25Updated 2 years ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 5 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated 2 years ago
- ☆28Updated 6 months ago
- ☆23Updated 2 years ago