shreyashankar / spade-experimentsLinks
Experiments to assess SPADE on different LLM pipelines.
☆17Updated last year
Alternatives and similar repositories for spade-experiments
Users that are interested in spade-experiments are comparing it to the libraries listed below
Sorting:
- ☆18Updated 3 weeks ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated this week
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆22Updated 2 months ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆28Updated 7 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- ☆29Updated 2 years ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆50Updated 3 weeks ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆23Updated 9 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated this week
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆27Updated this week
- ☆40Updated 11 months ago
- ☆23Updated last year
- ☆27Updated 5 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆39Updated 7 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- ☆34Updated this week
- ☆15Updated 2 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Updated 2 years ago
- ☆26Updated 2 months ago
- Compression for Foundation Models☆32Updated 3 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 7 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆23Updated 2 months ago
- Self-host LLMs with LMDeploy and BentoML☆20Updated 2 weeks ago
- The repository contains generative AI analytics platform application code.☆26Updated last month
- ☆51Updated 7 months ago
- ☆14Updated last month