aniket-mish / cuda
everything i know about cuda and triton
☆13Updated 3 months ago
Alternatives and similar repositories for cuda
Users that are interested in cuda are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆110Updated last week
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆118Updated 3 weeks ago
- lancedb-myntra-fashion-search☆27Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆103Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated 11 months ago
- Find your Twin Celebrity in Vector Space☆38Updated 4 months ago
- ☆80Updated 3 weeks ago
- ☆15Updated last year
- A template to kick-start your Python project ✨🚀☆51Updated 4 months ago
- ☆29Updated last year
- AI agent with RAG+ReAct on Indian Constitution & BNS☆64Updated 6 months ago
- Coding an LLM and its building blocks from scratch.☆34Updated last month
- ☆89Updated last month
- ☆34Updated last week
- GenAI Experimentation☆58Updated 2 weeks ago
- ☆89Updated last year
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆129Updated 2 months ago
- 📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language querie…☆64Updated 2 weeks ago
- Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.☆47Updated last month
- zero-to-lightning☆30Updated last year
- ☆61Updated 7 months ago
- Make Llama 3.1 8B talk in Rick Sanchez’s style☆115Updated 3 months ago
- ☆15Updated 9 months ago
- Just enough Kubernetes for you to fly☆187Updated last month
- repo of paper implementations☆19Updated 2 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆27Updated 2 weeks ago
- A collection of MCP servers.☆19Updated last month
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- building a Large Language Model (LLM) from scratch.☆31Updated 3 months ago
- Running load tests on a FastAPI application using Locust☆15Updated last month