shreyashankar / spade-experiments
Experiments to assess SPADE on different LLM pipelines.
☆16Updated 10 months ago
Alternatives and similar repositories for spade-experiments:
Users that are interested in spade-experiments are comparing it to the libraries listed below
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆35Updated this week
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆24Updated 3 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated 2 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆13Updated last month
- ☆9Updated last year
- ☆25Updated last month
- Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)☆15Updated last year
- ☆28Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆40Updated 10 months ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 6 months ago
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆21Updated 5 months ago
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆42Updated 2 years ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 2 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated last year
- ☆18Updated 3 weeks ago
- ☆26Updated last year
- ☆22Updated 3 months ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆18Updated 5 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 10 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆18Updated 2 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 5 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆48Updated 2 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆63Updated last year
- Repository for Skill Set Optimization☆12Updated 6 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 8 months ago