ModelCloud / Device-SMILinks
Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.
☆12Updated last week
Alternatives and similar repositories for Device-SMI
Users that are interested in Device-SMI are comparing it to the libraries listed below
Sorting:
- ☆18Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- DACVAE☆124Updated this week
- Google TPU optimizations for transformers models☆125Updated 10 months ago
- ☆14Updated 5 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆40Updated 2 months ago
- ☆138Updated 4 months ago
- ☆62Updated 5 months ago
- ☆68Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- Kyutai with an "eye"☆230Updated 8 months ago
- High-throughput tensor loading for PyTorch☆211Updated 2 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- Train LLM on Hugging Face infra☆67Updated last month
- Curriculum training of instruction-following LLMs with Unsloth☆14Updated this week
- ☆159Updated 8 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 9 months ago
- ☆117Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆45Updated this week
- Utils for Unsloth https://github.com/unslothai/unsloth☆181Updated this week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated last month
- Efficient non-uniform quantization with GPTQ for GGUF☆57Updated 3 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- ☆101Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- Lego for GRPO☆30Updated 6 months ago
- ☆36Updated 4 months ago
- unsloth-5090-multiple☆60Updated 7 months ago