CentraleSupelec / aristote-dispatcherLinks
🔀 Deployement of LLM at a large scale using VLLM server for inference
☆25Updated 2 weeks ago
Alternatives and similar repositories for aristote-dispatcher
Users that are interested in aristote-dispatcher are comparing it to the libraries listed below
Sorting:
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆41Updated this week
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆120Updated 3 weeks ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆66Updated 2 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated last year
- NLP with Rust for Python 🦀🐍☆64Updated 3 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated last year
- PyLate efficient inference engine☆64Updated last month
- Voyage AI Official Python Library☆71Updated last month
- Your buddy in the (L)LM space.☆64Updated 11 months ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆138Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 5 months ago
- Experimental wasm32-unknown-wasi runtime for Python code execution☆37Updated 9 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 8 months ago
- ☆58Updated 3 months ago
- Granite 3.1 Language Models☆120Updated 2 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated 3 months ago
- Python library to use Pleias-RAG models☆61Updated 4 months ago
- Model implementation for the contextual embeddings project☆35Updated 3 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆38Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago
- Pre-train Static Word Embeddings☆85Updated this week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 4 months ago
- ☆42Updated 4 months ago
- JavaScript library to explain any machine learning models anywhere!☆65Updated 2 years ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆223Updated this week
- API de recherche et de consultation de la plateforme JUDILIBRE.☆21Updated this week
- ☆35Updated last week