CentraleSupelec / aristote-dispatcherLinks
🔀 Deployement of LLM at a large scale using VLLM server for inference
☆26Updated 3 weeks ago
Alternatives and similar repositories for aristote-dispatcher
Users that are interested in aristote-dispatcher are comparing it to the libraries listed below
Sorting:
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆45Updated this week
 - Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆120Updated 2 months ago
 - Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 7 months ago
 - Observatoire des Médias sur l'Ecologie☆35Updated this week
 - 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
 - ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆82Updated last year
 - Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
 - AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆31Updated 3 weeks ago
 - A toolkit for exhaustively modeling the environmental impact of digital services.☆13Updated this week
 - Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 4 months ago
 - ☆139Updated last year
 - This project defines a json ontology standard describing a power consumption measure in a given software/hardware context, noticeably in …☆14Updated 2 weeks ago
 - Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
 - A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
 - Inference engine for GLiNER models, in Rust☆75Updated last week
 - ☆102Updated last year
 - TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
 - Self-host LLMs with vLLM and BentoML☆153Updated this week
 - Your buddy in the (L)LM space.☆64Updated last year
 - NLP with Rust for Python 🦀🐍☆65Updated 5 months ago
 - A curated list of awesome Green AI resources and tools to assess and reduce the environmental impacts of using and deploying AI.☆91Updated 3 weeks ago
 - The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
 - IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated last month
 - ☆124Updated last year
 - Granite 3.1 Language Models☆127Updated 4 months ago
 - Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
 - 🌱 EcoLogits tracks the energy consumption and environmental footprint of using generative AI models through APIs.☆225Updated last week
 - ☆64Updated 7 months ago
 - ☆16Updated 4 months ago
 - experiments with inference on llama☆103Updated last year