ThinamXx / Meta-llamaLinks
Complete implementation of Llama2 with/without KV cache & inference π
β46Updated last year
Alternatives and similar repositories for Meta-llama
Users that are interested in Meta-llama are comparing it to the libraries listed below
Sorting:
- β39Updated last month
- Building GPT ...β18Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ111Updated 8 months ago
- Prune transformer layersβ69Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β112Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- Direct Preference Optimization Implementationβ16Updated last year
- A collection of fine-tuning notebooks!β27Updated last year
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Scβ¦β115Updated last month
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- zero-to-lightningβ29Updated last year
- A collection of hand on notebook for LLMs practitionerβ48Updated 5 months ago
- Various installation guides for Large Language Modelsβ70Updated 2 months ago
- Fine tune Gemma 3 on an object detection taskβ57Updated this week
- Making of cuda kernelβ16Updated 3 weeks ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ197Updated last year
- Distributed training (multi-node) of a Transformer modelβ71Updated last year
- β143Updated 11 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ116Updated 5 months ago
- Notebooks for fine tuning pali gemmaβ109Updated 2 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β45Updated 9 months ago
- I learn about and explain quantizationβ26Updated last year
- β77Updated last year
- Large Language Model (LLM) Inference API and Chatbotβ126Updated last year
- everything i know about cuda and tritonβ13Updated 4 months ago
- ML/DL Math and Method notesβ61Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ105Updated 2 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β311Updated last month