sambanova / tutorialsLinks
☆13Updated last year
Alternatives and similar repositories for tutorials
Users that are interested in tutorials are comparing it to the libraries listed below
Sorting:
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆17Updated last year
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆13Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated 3 weeks ago
- ScalarLM - a unified training and inference stack☆96Updated 2 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 6 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Updated 4 months ago
- Training hybrid models for dummies.☆29Updated 3 months ago
- ☆15Updated 9 months ago
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- AMD HPC Research Fund Cloud☆17Updated last week
- ☆15Updated 2 years ago
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- Adaptive Parallel PDF Parsing and Resource Scaling Engine☆62Updated last month
- LM engine is a library for pretraining/finetuning LLMs☆113Updated this week
- ☆20Updated this week
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Updated last month
- Write a fast kernel and run it on Discord. See how you compare against the best!☆68Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 10 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆20Updated 2 weeks ago
- Conversational agents for engineering simulations with minimal human input using Microsoft AutoGen & GPT-4o.☆40Updated last year
- ☆20Updated last year
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆38Updated last month
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆60Updated this week
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆17Updated last year