CentraleSupelec / aristote-dispatcherLinks
🔀 Deployement of LLM at a large scale using VLLM server for inference
☆28Updated last month
Alternatives and similar repositories for aristote-dispatcher
Users that are interested in aristote-dispatcher are comparing it to the libraries listed below
Sorting:
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆121Updated 5 months ago
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆59Updated this week
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 10 months ago
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆88Updated 2 months ago
- ☆18Updated 6 months ago
- 🌱 EcoLogits tracks the energy consumption and environmental footprint of using generative AI models through APIs.☆248Updated last week
- This project defines a json ontology standard describing a power consumption measure in a given software/hardware context, noticeably in …☆15Updated 2 months ago
- An open-source, unified interface for running and managing self-hosted LLMs.☆119Updated last week
- Fully local, private and cross platform Speech-to-Text with LLM Post-processing☆329Updated this week
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆28Updated 10 months ago
- ☆140Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- Bot for Tchap (the messaging app of the French State) using Albert, the French administration Artificial Intelligence agent☆15Updated last year
- Allows to check regexes for overlaps. Based on greenery by @qntm.☆56Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Datamodels for hugging face tokenizers☆86Updated 2 weeks ago
- experiments with inference on llama☆103Updated last year
- Self-host LLMs with vLLM and BentoML☆167Updated last week
- ☆19Updated 4 years ago
- A curated list of awesome Green AI resources and tools to assess and reduce the environmental impacts of using and deploying AI.☆102Updated last month
- Experimental wasm32-unknown-wasi runtime for Python code execution☆40Updated last year
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆35Updated last month
- Question Answering annotation platform - Plateforme d'annotation☆90Updated last year
- Démo du Design System de l'État (ressource non officielle)☆34Updated 3 years ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17Updated 2 years ago
- NLP with Rust for Python 🦀🐍☆70Updated 8 months ago
- Model implementation for the contextual embeddings project☆40Updated 7 months ago
- Command Line Interface for Hugging Face Inference Endpoints☆65Updated last year