CentraleSupelec / aristote-dispatcherLinks
🔀 Deployement of LLM at a large scale using VLLM server for inference
☆27Updated last month
Alternatives and similar repositories for aristote-dispatcher
Users that are interested in aristote-dispatcher are comparing it to the libraries listed below
Sorting:
- Interroger à l'aveugle deux modèles de langage conversationnels sur des tâches exprimées en français et comparer les résultats.☆52Updated this week
- Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents qu…☆120Updated 3 months ago
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 8 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated 4 months ago
- API de recherche et de consultation de la plateforme JUDILIBRE.☆22Updated last month
- NLP with Rust for Python 🦀🐍☆66Updated 6 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- 🌱 EcoLogits tracks the energy consumption and environmental footprint of using generative AI models through APIs.☆228Updated last week
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- Temporary repo to split the pseudo livrable☆17Updated 5 years ago
- experiments with inference on llama☆103Updated last year
- Model implementation for the contextual embeddings project☆36Updated 5 months ago
- Sentence Embedding as a Service☆15Updated 4 months ago
- FRP Fork☆176Updated 7 months ago
- Datamodels for hugging face tokenizers☆86Updated this week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated this week
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- ☆64Updated 7 months ago
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆32Updated last week
- A repository of instructions in French to fine-tune LLMs☆17Updated 2 years ago
- ☆124Updated last year
- ☆138Updated 3 months ago
- ☆39Updated 3 years ago
- ☆43Updated last month
- ☆79Updated 2 weeks ago
- Deploy and Scale LLM-based applications☆26Updated 2 years ago
- ☆58Updated this week
- Granite 3.1 Language Models☆131Updated 4 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year