shreshthkapai / cuda_latency_benchmarkView external linksLinks
High-performance CUDA kernels for real-time financial low latency inference, optimized for both consumer and datacenter GPUs.
☆20Jul 25, 2025Updated 6 months ago
Alternatives and similar repositories for cuda_latency_benchmark
Users that are interested in cuda_latency_benchmark are comparing it to the libraries listed below
Sorting:
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- ☆12Jul 8, 2024Updated last year
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- 1st Place Team Crane: @aswinkumar1999 @rathull @kyolebu☆29Sep 8, 2025Updated 5 months ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆14Feb 24, 2025Updated 11 months ago
- The course work repo for UoSurrey EEEM071 (2023 Spring)☆11May 9, 2023Updated 2 years ago
- A Java-based framework for combinatorial test input generation, fault characterization and automated test execution.☆11Jan 22, 2024Updated 2 years ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 8 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Official repository for the ICCV 2021 (Oral) paper "(Just) A Spoonful of Refinements Helps the Registration Error Go Down"☆11Dec 21, 2021Updated 4 years ago
- Image Segmentation using k-means, n-cuts and superpixels☆11Mar 31, 2019Updated 6 years ago
- AI Security Newsletter - A monthly digest of AI security research, insights, reports, upcoming events, and tools & resources☆22Feb 5, 2026Updated last week
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- API wrapper for local LLMs☆12Apr 24, 2023Updated 2 years ago
- AI Agents using Crew AI☆12Jun 16, 2024Updated last year
- machine learning specilization course 2☆12Dec 23, 2018Updated 7 years ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- Project repo for the paper SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition☆10Dec 17, 2021Updated 4 years ago
- 16 projects in the framework of Computer Vision algorithms: 16 projects in the framework of Computer Vision algorithms: CNN, RNN, LSTM, F…☆11Aug 24, 2020Updated 5 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- ☆16Apr 7, 2025Updated 10 months ago
- LightGaussian tailored for large-scale scene. Used by https://github.com/DekuLiuTesla/CityGaussian☆12Oct 9, 2024Updated last year
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- pytorch implementation of "S3NET: GRAPH REPRESENTATIONAL NETWORK FOR SKETCH RECOGNITION"☆10Oct 6, 2020Updated 5 years ago
- ☆21Jun 22, 2025Updated 7 months ago
- AI/ML/NLP and Computer Science Training☆13Sep 20, 2018Updated 7 years ago
- ☆14Apr 11, 2017Updated 8 years ago
- ☆16Nov 23, 2023Updated 2 years ago
- 🧠 Workshop Notebook and assets for the Anthropic Hackathon☆12Nov 4, 2023Updated 2 years ago
- A tiny easily hackable implementation of a feature dashboard.☆15Oct 21, 2025Updated 3 months ago
- Tools for optimizing steering vectors in LLMs.☆19Apr 10, 2025Updated 10 months ago
- ☆17Jan 19, 2025Updated last year
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆21Oct 16, 2025Updated 3 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Code for Paper ACL'25: FiDELIS: Faithful Reasoning of Large Language Model on Knowledge Graph Question Answering☆18May 8, 2025Updated 9 months ago
- Code to go along with my AI agents youtube video☆17Apr 5, 2024Updated last year