intel / Enterprise-RAGLinks
Intel® AI for Enterprise RAG converts enterprise data into actionable insights with excellent TCO. Utilizing Intel Gaudi AI accelerators and Intel Xeon processors ensuring streamlined deployment.
☆19Updated 11 months ago
Alternatives and similar repositories for Enterprise-RAG
Users that are interested in Enterprise-RAG are comparing it to the libraries listed below
Sorting:
- A dynamic binary instrumentation tool for tracing and analyzing CUDA kernel instructions.☆27Updated last week
- AI Tensor Engine for ROCm☆348Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆258Updated 2 weeks ago
- ☆281Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆518Updated this week
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 10 months ago
- An Awesome list of oneAPI projects☆158Updated 5 months ago
- amdgpu example code in hip/asm☆54Updated this week
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆66Updated last week
- ☆61Updated last year
- CUDA Kernel Benchmarking Library☆806Updated last week
- ☆53Updated last week
- ☆137Updated last week
- oneAPI Collective Communications Library (oneCCL)☆254Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆410Updated last week
- super repo for rocm systems projects☆230Updated this week
- oneAPI Level Zero Specification Headers and Loader☆303Updated this week
- ☆95Updated last month
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆125Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- OpenAI Triton backend for Intel® GPUs☆226Updated this week
- Training material for Nsight developer tools☆178Updated last year
- OpenVINO Intel NPU Compiler☆81Updated last week
- CUDA/Metal accelerated language model inference☆626Updated 8 months ago
- Super fast FP32 matrix multiplication on RDNA3☆82Updated 10 months ago
- Fast and Furious AMD Kernels☆348Updated 2 weeks ago
- SYCL implementation of Fused MLPs for Intel GPUs☆51Updated 2 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆175Updated 3 years ago
- ☆20Updated last week
- Derived from Nemes' gpuperftests☆33Updated last year