NSA Triton Kernels written with GPT5 and Opus 4.1
☆70Aug 12, 2025Updated 8 months ago
Alternatives and similar repositories for NSA-Test
Users that are interested in NSA-Test are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆38Aug 7, 2025Updated 8 months ago
- An llm wrapper for OpenAI☆12Dec 14, 2024Updated last year
- L3 R3: AGM RISC-V +CPLD/FPGA MCU (AG32VH407/AG32VF407/AG32VF303)☆13Nov 3, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆12Jul 9, 2021Updated 4 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- This is sample code for Paho MQTT server with Python 2.7☆10Mar 29, 2016Updated 10 years ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 8 months ago
- trio async MQTT client that wraps paho-mqtt☆12Feb 8, 2021Updated 5 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- ☆11Oct 13, 2023Updated 2 years ago
- Data Wrangling, Linear Models & other misc. Inferential Statistics.☆14Jul 16, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆20Mar 3, 2024Updated 2 years ago
- Hybrid Search (BM25 & Vector) with SQLite☆32Aug 13, 2024Updated last year
- PyTorch implementation of the Flash Spectral Transform Unit.☆22Sep 19, 2024Updated last year
- A lightweight code assistant with tool-using capabilities built on HuggingFace's smolagents.☆41Jun 11, 2025Updated 10 months ago
- A collection of Compound Retrieval Systems implemented with DSPy and Weaviate.☆96Apr 3, 2026Updated last week
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 3 months ago
- ☆16Jan 18, 2025Updated last year
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated 11 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- smol models are fun too☆93Nov 9, 2024Updated last year
- Lego for GRPO☆30May 27, 2025Updated 10 months ago
- KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation☆48Mar 30, 2026Updated 2 weeks ago
- ☆18Dec 2, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆34Nov 29, 2024Updated last year
- ☆29Apr 6, 2026Updated last week
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Lock-free linked list☆16Nov 10, 2012Updated 13 years ago
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- The Code & Paper for ACL 2023 paper "Enhancing Language Representation with Constructional Information for Natural Language Understanding…☆20Jan 18, 2025Updated last year
- Git on a Durable Object☆38Jan 1, 2026Updated 3 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- Some utility functions to help myself (and perhaps others) go faster with ML/AI work☆46Feb 11, 2026Updated 2 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆132Dec 3, 2024Updated last year