Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆76Apr 28, 2025Updated last year
Alternatives and similar repositories for Encoder-Block-in-CUDA
Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformer Architecture written with CUDA, C++ and LibTorch.☆10Jul 26, 2025Updated 11 months ago
- Low memory full parameter finetuning of LLMs☆54Jul 18, 2025Updated 11 months ago
- Quantum continuous variable operations simulated in TFQ.☆22Jul 14, 2023Updated 2 years ago
- Mixtral finetuning☆19Feb 2, 2024Updated 2 years ago
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Debug as an Effect (DaaE)☆10Apr 22, 2025Updated last year
- ☆17May 15, 2024Updated 2 years ago
- Orchestrate sandboxed agents that run in the cloud while you work. Fully open source☆76Updated this week
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 5 months ago
- ☆11May 2, 2022Updated 4 years ago
- coding CUDA everyday!☆76Feb 5, 2026Updated 4 months ago
- Codebase from our first release.☆58Feb 17, 2026Updated 4 months ago
- Repository to create traveling waves integrate special information through time☆58Aug 8, 2025Updated 10 months ago
- KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines☆23Aug 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆16Aug 10, 2025Updated 10 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 10 months ago
- zer0dex is a local dual-layer memory pattern for AI agents: a compressed, human-readable markdown index plus a vector store queried autom…☆53Updated this week
- LLM query engine to retrieve augmented responses from json files.☆15Oct 12, 2023Updated 2 years ago
- Learnings and programs related to CUDA☆438Jun 29, 2025Updated last year
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆44Dec 12, 2025Updated 6 months ago
- Demo for dependent types + runtime code generation☆72Feb 18, 2025Updated last year
- Talk to your data. Instantly analyze, visualize, and transform☆22Oct 30, 2025Updated 8 months ago
- Exploration work on executing CUDA kernels on Apple Silicon (Metal-compatible code).☆37Jun 10, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- HashIndex: LLM-optimized Document Indexing without vector search☆55Jan 24, 2026Updated 5 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- The Structure and Interpretation of Tensor Programs: The Hacker's Accelerated Introduction to Deep Learning and Deep Learning Systems☆80Jun 25, 2026Updated last week
- Roadmap for Data Science circle associated with CAT Reloaded.☆39May 1, 2025Updated last year
- If you can read ~200 lines of Python, you understand MCP.☆69Mar 12, 2026Updated 3 months ago
- Chess engine that uses neural network to decide on moves☆32Jan 6, 2023Updated 3 years ago
- Code execution runtime for the STAC Overflow: Map Floodwater from Radar Imagery competition☆12Sep 29, 2021Updated 4 years ago
- a simple CLI command that will create a template of a generic ML Project☆82Dec 6, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Adds timm pretrained backbone to pytorch's FasterRcnn model☆12Jan 25, 2024Updated 2 years ago
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 10 months ago
- RAG based agent with chDB(ClickHouse)☆23May 14, 2025Updated last year
- LLM-Powered Data Discovery System for Tabular Data☆32Apr 7, 2026Updated 2 months ago
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 9 months ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 3 years ago
- A small experiment on assigning a processes threads a specific CPU and then blocking it with a high priority thread☆33Sep 24, 2025Updated 9 months ago