withmartian / leaderboard-backend
Open sourced backend for Martian's LLM Inference Provider Leaderboard
☆15Updated last month
Related projects: ⓘ
- Index of URLs to pdf files all over the internet and scripts☆20Updated last year
- ☆22Updated 3 months ago
- ☆18Updated this week
- 🌏 Modular retrievers for zero-shot multilingual IR.☆26Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆36Updated 8 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆34Updated 3 weeks ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆29Updated 6 months ago
- ☆42Updated 3 weeks ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆54Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated 3 weeks ago
- ☆45Updated 7 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated 11 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆33Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 6 months ago
- Truly flash T5 realization!☆48Updated 4 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆48Updated last week
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆42Updated 10 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- Make triton easier☆39Updated 3 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆24Updated 4 months ago
- Cascade Speculative Drafting☆23Updated 5 months ago
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆65Updated 6 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated last week
- ☆12Updated last year
- ☆13Updated 3 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆22Updated 5 months ago
- ☆60Updated 5 months ago