☆17Mar 4, 2026Updated this week
Alternatives and similar repositories for gaudi-pytorch-bridge
Users that are interested in gaudi-pytorch-bridge are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆68Updated this week
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆14Dec 3, 2024Updated last year
- PM Workshop China☆10Apr 11, 2019Updated 6 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- ☆11Nov 2, 2017Updated 8 years ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆27Apr 27, 2025Updated 10 months ago
- Building LaTeX packages using Travis-CI☆12Dec 21, 2019Updated 6 years ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆22Jul 17, 2025Updated 7 months ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- ☆13Apr 30, 2024Updated last year
- oneAPI - Data Parallel C++ course for students☆44Nov 4, 2024Updated last year
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- ☆21Oct 22, 2025Updated 4 months ago
- Domain-Agnostic Supervised Learning with Hyperdimensional Computing☆13Jun 14, 2024Updated last year
- ☆19Jan 28, 2026Updated last month
- A PyTorch native platform for training generative AI models☆15Nov 18, 2025Updated 3 months ago
- Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…☆36Updated this week
- ☆10Oct 28, 2024Updated last year
- ☆36Jan 13, 2026Updated last month
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- Benchmark Suite Invocation Scripting☆11Mar 16, 2022Updated 3 years ago
- X-ANFIS: An Extensible and Cross-Learning ANFIS Framework for Machine Learning Tasks☆17Jun 7, 2025Updated 9 months ago
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- ☆15Updated this week
- Curl: Private LLMs through Wavelet-Encoded Look-Up Tables☆16Apr 7, 2025Updated 11 months ago
- Differentiable Clustering with Perturbed Random Forests, NeurIPS2023☆13Oct 16, 2023Updated 2 years ago
- ☆16Mar 1, 2025Updated last year
- ☆12Oct 1, 2024Updated last year
- ☆13Aug 19, 2024Updated last year
- SPDK fork of nvme-cli. No longer supported - use standard nvme-cli with SPDK nvme CUSE instead. See https://spdk.io/doc/nvme.html#nvme_…☆15Apr 10, 2024Updated last year
- helm charts for deploying models with llm-d☆29Updated this week
- ☆16Dec 9, 2023Updated 2 years ago
- ☆13Oct 29, 2021Updated 4 years ago
- A stateful serverless demo app running on AWS Lambda, using Apache Flink Stateful Functions☆15Oct 13, 2020Updated 5 years ago