withmartian / leaderboard-backendLinks
Open sourced backend for Martian's LLM Inference Provider Leaderboard
☆18Updated 9 months ago
Alternatives and similar repositories for leaderboard-backend
Users that are interested in leaderboard-backend are comparing it to the libraries listed below
Sorting:
- Index of URLs to pdf files all over the internet and scripts☆23Updated 2 years ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆72Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆58Updated last year
- ☆64Updated last year
- ☆28Updated 4 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 weeks ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 6 months ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Model implementation for the contextual embeddings project☆26Updated this week
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- Adversarial Training and SFT for Bot Safety Models☆40Updated 2 years ago
- Repository for CPU Kernel Generation for LLM Inference☆26Updated last year
- DPO, but faster 🚀☆42Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- ☆27Updated 4 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 7 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 6 months ago
- Tools for content datamining and NLP at scale☆43Updated 11 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆24Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆27Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- The backend behind the LLM-Perf Leaderboard☆10Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year