memryx / MemryX_eXamplesLinks
A collection of practical, end-to-end AI application examples accelerated by MemryX hardware and software solutions. This repository offers examples for real-time video inference, object detection, text generation, and more. Explore the code, contribute to the projects, and access detailed tutorials to maximize the potential of MemryX technolog…
☆83Updated this week
Alternatives and similar repositories for MemryX_eXamples
Users that are interested in MemryX_eXamples are comparing it to the libraries listed below
Sorting:
- Notes on quantization in neural networks☆117Updated 2 years ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆47Updated 3 months ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆64Updated 4 months ago
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆112Updated last year
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Updated last year
- Some CUDA example code with READMEs.☆179Updated 2 months ago
- ☆95Updated this week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆175Updated this week
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆25Updated 4 years ago
- Pre-built components and code samples to help you build and deploy production-grade AI applications with the OpenVINO™ Toolkit from Intel☆201Updated last week
- First Open-Source Industry-Specific Model for Semiconductors☆393Updated 9 months ago
- RapidFire AI: Rapid AI Customization from RAG to Fine-Tuning☆138Updated this week
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆175Updated last year
- E2E AutoML Model Compression Package☆45Updated 10 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆71Updated 2 months ago
- Code for HyperSeg and HyperSum☆16Updated 6 months ago
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- 100 days of CUDA Challenge☆47Updated 6 months ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆432Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆68Updated last week
- ☆24Updated last year
- This project is a native implementation of a RAG pipeline for Small Language Models tested on Android devices. The main goal was to fit t…☆100Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- No-code CLI designed for accelerating ONNX workflows☆227Updated 7 months ago
- Code for paper "Analog Foundation Models"☆30Updated 4 months ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Updated last year
- SandLogic Lexicons☆20Updated 4 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Updated 10 months ago
- Samples of good AI generated CUDA kernels☆99Updated 8 months ago