ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference π
β47Updated 10 months ago
Alternatives and similar repositories for Meta-llama:
Users that are interested in Meta-llama are comparing it to the libraries listed below
- Fine-tune an LLM to perform batch inference and online serving.β104Updated this week
- A set of scripts and notebooks on LLM finetunning and dataset creationβ105Updated 6 months ago
- Building GPT ...β17Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Prune transformer layersβ68Updated 10 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- Direct Preference Optimization Implementationβ16Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ100Updated 2 months ago
- Notebooks for fine tuning pali gemmaβ98Updated 3 months ago
- A collection of hand on notebook for LLMs practitionerβ47Updated 2 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β111Updated this week
- zero-to-lightningβ29Updated 11 months ago
- minimal GRPO implementation from scratchβ72Updated 3 weeks ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- I learn about and explain quantizationβ26Updated 11 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β44Updated 7 months ago
- Collection of autoregressive model implementationβ85Updated last month
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β284Updated last month
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- Quantization of LLMs and benchmarking.β10Updated last year
- β22Updated 6 months ago
- Repo for ML Models built from scratch such as Self-Attention, Linear +Logistic Regression, PCA, LDA. CNN, LSTM, Neural Networks using Nuβ¦β47Updated 2 months ago
- ML/DL Math and Method notesβ60Updated last year
- A template to kick-start your Python project β¨πβ51Updated 3 months ago
- Distributed training (multi-node) of a Transformer modelβ64Updated last year
- A miniture AI training framework for PyTorchβ40Updated 2 months ago
- β77Updated 10 months ago
- Making of cuda kernelβ14Updated last week