A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that includes GQA (Grouped Query Attention) , RoPE (Rotary Positional Embeddings) , RMS Norm, FeedForward Block, Encoder (as this is only for Inferencing the model) , SwiGLU (Activation Function),
☆13May 6, 2024Updated last year
Alternatives and similar repositories for llama-inference
Users that are interested in llama-inference are comparing it to the libraries listed below
Sorting:
- A curated collection of prompts for Grok Imagine by xAI☆23Oct 19, 2025Updated 4 months ago
- Unlock the groundbreaking advances of deep learning with this extensively revised new edition of the bestselling original. Learn directly…☆10Feb 27, 2023Updated 3 years ago
- PyTorch library to accelerate super-resolution research☆11Jun 23, 2024Updated last year
- SYN flood implementation using Boost.Asio☆12Nov 20, 2014Updated 11 years ago
- Neural Network Execution Service☆11Oct 3, 2023Updated 2 years ago
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10May 8, 2024Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- Another Unofficial Python Wrapper for Coinbase Pro☆10Oct 28, 2022Updated 3 years ago
- ☆14Apr 14, 2021Updated 4 years ago
- Utility of SMI (Secondary Memory Interface) of Raspberry Pi☆12Apr 29, 2017Updated 8 years ago
- MATLAB wrapper for LightGBM☆14Jul 30, 2017Updated 8 years ago
- ☆13Sep 12, 2024Updated last year
- How to build Text-to-Image app using stable diffusion via hugging face☆10May 28, 2023Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- This Streamlit application creates an interactive Data Visualization Assistant that can understand Natural Language Queries and generate …☆17Jan 13, 2025Updated last year
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Files used for the evaluation of uiCA☆18Dec 14, 2022Updated 3 years ago
- A barely barebone NumPy implementation of Hierarchical Temporal Memory.☆11Mar 26, 2023Updated 2 years ago
- 《GPT-4, ChatGPT, 라마인덱스, 랭체인을 활용한 인공지능 프로그래밍》 예제 코드☆10Jan 16, 2024Updated 2 years ago
- Simple CTC implementation for PyTorch☆14Oct 25, 2017Updated 8 years ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆17Jan 6, 2025Updated last year
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆21Dec 31, 2024Updated last year
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- An open-source translation agent designed to enhance the quality of text translations by leveraging large language models☆21Aug 12, 2025Updated 6 months ago
- Benchmark Generator for Global Routing☆13Jul 18, 2019Updated 6 years ago
- This is a repository for sharing the dataset used for testing in Seahorse DB.☆38Dec 8, 2025Updated 3 months ago
- An HTTP Server for FPGAs☆16Sep 26, 2023Updated 2 years ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Sep 19, 2024Updated last year
- simple program that allows an Arduino to be re-programmed to emulate a USB keyboard that reads input from a serial connection.☆12Jan 17, 2025Updated last year
- DEPRECATED: research attempt to build e2e task oriented chatbot optimized over conversational data and content of DB (single table)☆11Sep 28, 2016Updated 9 years ago
- Graph Homomorphism Convolution (ICML'20)☆12Jul 6, 2023Updated 2 years ago
- Notes from Reinforcement Learning Specialisaiton☆10Jul 6, 2021Updated 4 years ago
- A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.☆12Mar 19, 2021Updated 4 years ago
- Deep learning in time series analysis☆13May 21, 2018Updated 7 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 9 months ago
- Algorithm study using python day by day☆13Apr 9, 2017Updated 8 years ago
- Machine Learning algorithms implementation in Python from scratch.☆11Feb 10, 2019Updated 7 years ago