withmartian / leaderboard-backend
Open sourced backend for Martian's LLM Inference Provider Leaderboard
☆17Updated 8 months ago
Alternatives and similar repositories for leaderboard-backend:
Users that are interested in leaderboard-backend are comparing it to the libraries listed below
- A collection of reproducible inference engine benchmarks☆24Updated this week
- [EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.☆73Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- ☆47Updated 7 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Index of URLs to pdf files all over the internet and scripts☆23Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Tools for content datamining and NLP at scale☆43Updated 10 months ago
- Repository for CPU Kernel Generation for LLM Inference☆26Updated last year
- Simple repository for training small reasoning models☆12Updated 2 months ago
- Adversarial Training and SFT for Bot Safety Models☆39Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- DPO, but faster 🚀☆40Updated 4 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- ☆49Updated last year
- ☆64Updated last year
- ☆27Updated last month
- Make triton easier☆47Updated 10 months ago
- Code for NeurIPS LLM Efficiency Challenge☆57Updated last year
- ☆33Updated 10 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆49Updated last week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆42Updated last year
- Repository for Skill Set Optimization☆12Updated 8 months ago
- ☆14Updated 6 months ago
- ☆28Updated 5 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- ☆28Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated last year
- ☆38Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago