HabanaAI / Gaudi-solutionsLinks
Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases
☆13Updated last year
Alternatives and similar repositories for Gaudi-solutions
Users that are interested in Gaudi-solutions are comparing it to the libraries listed below
Sorting:
- Cray-LM unified training and inference stack.☆22Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆117Updated 8 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 5 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆170Updated 3 weeks ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆205Updated this week
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- Find the optimal model serving solution for 🤗 Hugging Face models 🚀☆45Updated 6 months ago
- experiments with inference on llama☆103Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Updated 4 months ago
- A collection of all available inference solutions for the LLMs☆94Updated 10 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- SandLogic Lexicons☆20Updated 4 months ago
- Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀☆119Updated 2 years ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- A tool for benchmarking LLMs on Modal☆45Updated 5 months ago
- Experimentation on google's gemma model☆16Updated last year
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- ScalarLM - a unified training and inference stack☆96Updated 2 months ago
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆44Updated 4 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 10 months ago
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 3 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated 2 years ago
- ML/DL Math and Method notes☆66Updated 2 years ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆17Updated last year
- ☆67Updated 10 months ago
- PB-LLM: Partially Binarized Large Language Models☆157Updated 2 years ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆64Updated 4 months ago