HabanaAI / Gaudi-solutionsLinks
Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases
☆12Updated 10 months ago
Alternatives and similar repositories for Gaudi-solutions
Users that are interested in Gaudi-solutions are comparing it to the libraries listed below
Sorting:
- Cray-LM unified training and inference stack.☆22Updated 8 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 2 months ago
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆90Updated last week
- Build Agentic workflows with function calling using open LLMs☆28Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated last month
- Iterate fast on your RAG pipelines☆23Updated 3 months ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆200Updated this week
- SandLogic Lexicons☆19Updated last month
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 4 months ago
- Find the optimal model serving solution for 🤗 Hugging Face models 🚀☆44Updated 2 months ago
- ML/DL Math and Method notes☆64Updated last year
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆165Updated 3 weeks ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆50Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆52Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- Vector Database with support for late interaction and token level embeddings.☆55Updated 3 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- MLFlow End to End Workshop at Chandigarh University☆11Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆61Updated last month
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated last week
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 6 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- Mixtral finetuning☆19Updated last year
- ScalarLM - a unified training and inference stack☆85Updated 2 weeks ago
- ☆49Updated 8 months ago