shreyashankar / spade-experiments
Experiments to assess SPADE on different LLM pipelines.
☆16Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for spade-experiments
- Compression for Foundation Models☆19Updated 2 weeks ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆14Updated 4 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆38Updated 9 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆91Updated 4 months ago
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- ☆28Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆36Updated last year
- ☆30Updated last month
- This is the repository holding code and data for "FrugalML: How to Use ML Prediction APIs More Accurately and Cheaply".☆31Updated 3 years ago
- ☆39Updated 9 months ago
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆13Updated 3 weeks ago
- ☆9Updated 10 months ago
- ☆21Updated 11 months ago
- ☆35Updated last week
- Minimum Description Length probing for neural network representations☆16Updated last week
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated 11 months ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆75Updated 4 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated 9 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆18Updated 2 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆56Updated 3 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆38Updated 2 weeks ago
- ☆19Updated last year
- r2e: turn any github repository into a programming agent environment☆87Updated last week
- The official repo for "LLoCo: Learning Long Contexts Offline"☆110Updated 4 months ago
- ☆41Updated this week
- ☆26Updated 4 months ago
- ☆38Updated this week
- ☆14Updated this week
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆29Updated 6 months ago