☆14Apr 26, 2024Updated 2 years ago
Alternatives and similar repositories for llama2-from-scratch
Users that are interested in llama2-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jun 4, 2024Updated 2 years ago
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 11 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- Rust bindings for SPDK☆12Mar 5, 2020Updated 6 years ago
- Empowering Tomorrow Together: Your Community-Powered AI Platform☆14Aug 19, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 小彭老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- A fast implementation of Leiserchess AI for MIT 6.172`16 http://scrimmage.csail.mit.edu/☆12Dec 22, 2016Updated 9 years ago
- ☆10Nov 14, 2023Updated 2 years ago
- Asynchronous Rust bindings for SPDK.☆18Nov 1, 2022Updated 3 years ago
- CUDA_C编程权威指南示例代码☆13Mar 22, 2023Updated 3 years ago
- Mini CCL - A lightweight collective communication library☆32Jan 2, 2026Updated 5 months ago
- [TOG 2024] BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation☆16Jun 14, 2024Updated last year
- an implementation of parallel skills like amp, ddp, pp, tp for learning purposes☆14Nov 18, 2023Updated 2 years ago
- Simply drag and drop your PDF files into Preve to get started. Ask Preve questions about your document. Get Summaries, key points, specif…☆11Apr 9, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Rust FTL + WebRTC live streaming software.☆13Mar 12, 2022Updated 4 years ago
- This is the official implementation of the voxel-based humanoid locomotion in "Gallant: Voxel Grid-based Humanoid Locomotion and Local-na…☆69Apr 24, 2026Updated last month
- A string_view implementation that can remember if it was a c-string once☆20Nov 16, 2020Updated 5 years ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆24Jul 6, 2024Updated last year
- Reverse engineered Twitter's API☆12Nov 28, 2023Updated 2 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 11 months ago
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆19May 17, 2020Updated 6 years ago
- Optimizing diffusion for production-ready speeds☆40Jan 10, 2026Updated 4 months ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 9 months ago
- ☆19Apr 6, 2024Updated 2 years ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆16Apr 23, 2025Updated last year
- learning & making kernels in cuda / triton☆22Aug 24, 2025Updated 9 months ago
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- DGEMM on KNL, achieve 75% MKL☆19May 19, 2022Updated 4 years ago
- Dump page tables on various OSes and analyze them☆31Jan 15, 2016Updated 10 years ago
- Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning☆56Mar 16, 2026Updated 2 months ago
- Generative AI app for Lost and Found belonggins using Open AI clip-vit-large to create image embeddings and search them using Natural Lan…☆10Jul 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sort…☆22Jul 27, 2023Updated 2 years ago
- Developing a high-precision legal expert LLM application called Contract Advisor RAG. The project's goal is to create a Retrieval Augment…☆16Apr 10, 2024Updated 2 years ago
- Multimodal RAG using LlamaIndex, Qdrant, llama.cpp for document QA with local VisonLLM and embedding models☆20Nov 8, 2024Updated last year
- CPU Memory Compiler and Parallel programing☆26Nov 18, 2024Updated last year
- Alpha-Zero Connect Four NN trained via self play☆27Mar 7, 2025Updated last year
- This is the official repo of Text Summarizer Streamlit App video from AI Anytime YouTube channel.☆16Mar 21, 2024Updated 2 years ago
- Programming "Machine Learning in action" with python3.7.☆23Dec 9, 2019Updated 6 years ago