HabanaAI / Gaudi-solutionsLinks
Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases
☆12Updated 9 months ago
Alternatives and similar repositories for Gaudi-solutions
Users that are interested in Gaudi-solutions are comparing it to the libraries listed below
Sorting:
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆197Updated this week
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago
- ☆64Updated 5 months ago
- Cray-LM unified training and inference stack.☆22Updated 7 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- ☆14Updated 3 years ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated last week
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆166Updated this week
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆52Updated last month
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆50Updated last week
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last week
- A tool for benchmarking LLMs on Modal☆43Updated 3 weeks ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆61Updated last week
- A collection of all available inference solutions for the LLMs☆91Updated 6 months ago
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆12Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆49Updated 7 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆83Updated this week
- ☆19Updated last month
- SandLogic Lexicons☆19Updated 2 weeks ago
- ML/DL Math and Method notes☆63Updated last year