IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
β10Updated 6 months ago
Related projects β
Alternatives and complementary repositories for llm-perf-backend
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated this week
- π€ Trade any tensors over the networkβ30Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β57Updated 4 months ago
- Streamlit app for recommending eval functions using prompt diffsβ25Updated 10 months ago
- β24Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 4 months ago
- β22Updated last year
- β21Updated last week
- Check for data drift between two OpenAI multi-turn chat jsonl files.β36Updated 7 months ago
- Using modal.com to process FineWeb-edu dataβ19Updated 2 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β27Updated 2 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ28Updated 2 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated 10 months ago
- LLM reads a paper and produce a working prototypeβ36Updated last week
- Code for NeurIPS LLM Efficiency Challengeβ54Updated 7 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcβ¦β20Updated 8 months ago
- experiments with inference on llamaβ105Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!β43Updated 8 months ago
- β18Updated this week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ37Updated 7 months ago
- β41Updated 2 weeks ago
- Build Agentic workflows with function callingβ20Updated this week
- Public reports detailing responses to sets of prompts by Large Language Models.β26Updated last year
- Tools to make language models a bit easier to useβ30Updated this week
- Self-host LLMs with vLLM and BentoMLβ74Updated last week
- Repository containing awesome resources regarding Hugging Face tooling.β43Updated 10 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.β25Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β29Updated 6 months ago