philschmid / text-generation-inference-tests
☆20Updated 8 months ago
Related projects: ⓘ
- ☆75Updated 3 weeks ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆128Updated this week
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- ☆71Updated 3 months ago
- ☆31Updated 2 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆27Updated 3 weeks ago
- ☆43Updated 3 weeks ago
- ☆48Updated 11 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Self-host LLMs with vLLM and BentoML☆62Updated this week
- ☆24Updated last year
- ☆15Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆113Updated 7 months ago
- Voyage AI Official Python Library☆37Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆33Updated 5 months ago
- Vector Database with support for late interaction and token level embeddings.☆51Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 2 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆58Updated 2 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆67Updated 2 months ago
- ☆64Updated 3 months ago
- Tutorial for building LLM router☆145Updated 2 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆38Updated 3 weeks ago
- experiments with inference on llama☆106Updated 3 months ago
- A framework for evaluating function calls made by LLMs☆34Updated last month
- Google TPU optimizations for transformers models☆62Updated this week