HabanaAI / Gaudi-solutionsLinks
Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases
☆12Updated 8 months ago
Alternatives and similar repositories for Gaudi-solutions
Users that are interested in Gaudi-solutions are comparing it to the libraries listed below
Sorting:
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last week
- SandLogic Lexicons☆19Updated last week
- Cray-LM unified training and inference stack.☆22Updated 6 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated this week
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 2 months ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆61Updated last week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆191Updated this week
- ML/DL Math and Method notes☆62Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 10 months ago
- Find the optimal model serving solution for 🤗 Hugging Face models 🚀☆43Updated 2 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- PB-LLM: Partially Binarized Large Language Models☆153Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated 2 months ago
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆307Updated 2 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 7 months ago
- MLFlow End to End Workshop at Chandigarh University☆11Updated 2 years ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 4 months ago
- A collection of all available inference solutions for the LLMs☆91Updated 5 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- Multimodal AI workloads: batch inference, model training and online serving.☆22Updated 2 weeks ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀☆118Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Notes on quantization in neural networks☆95Updated last year
- GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…☆46Updated this week
- ☆14Updated 3 years ago
- A tool for benchmarking LLMs on Modal☆41Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆167Updated last week
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated last year