CentraleSupelec / aristote-dispatcherLinks
π Deployement of LLM at a large scale using VLLM server for inference
β26Updated this week
Alternatives and similar repositories for aristote-dispatcher
Users that are interested in aristote-dispatcher are comparing it to the libraries listed below
Sorting:
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents quβ¦β121Updated this week
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. βOpening up ChatGPT: Trackβ¦β119Updated 5 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.β42Updated last year
- β137Updated last year
- β41Updated 3 months ago
- Granite 3.1 Language Modelsβ117Updated last month
- β16Updated last month
- π· Build compute kernelsβ93Updated this week
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β137Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β66Updated last month
- NLP with Rust for Python π¦πβ64Updated 2 months ago
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation β¨ and compute time-slicingβ79Updated last year
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.β30Updated 4 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated 3 months ago
- FRP Forkβ175Updated 4 months ago
- A browser extension (for now only Chrome) that estimates the environmental impact of your AI interactions.β13Updated last month
- β124Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.β11Updated 10 months ago
- Voyage AI Official Python Libraryβ66Updated 2 weeks ago
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the β¦β59Updated 2 years ago
- A prompting libraryβ172Updated last month
- API de recherche et de consultation de la plateforme JUDILIBRE.β19Updated this week
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- β63Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 4 months ago
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.