ThinamXx / Meta-llamaLinks
Complete implementation of Llama2 with/without KV cache & inference π
β46Updated last year
Alternatives and similar repositories for Meta-llama
Users that are interested in Meta-llama are comparing it to the libraries listed below
Sorting:
- Building GPT ...β17Updated 6 months ago
- β35Updated last week
- A set of scripts and notebooks on LLM finetunning and dataset creationβ111Updated 8 months ago
- Fine-tune an LLM to perform batch inference and online serving.β111Updated 3 weeks ago
- Fine tune Gemma 3 on an object detection taskβ43Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 10 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ105Updated 4 months ago
- Prune transformer layersβ69Updated last year
- I learn about and explain quantizationβ26Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β122Updated last year
- A collection of hand on notebook for LLMs practitionerβ47Updated 4 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- Distributed training (multi-node) of a Transformer modelβ68Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- Notebooks for fine tuning pali gemmaβ107Updated last month
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Scβ¦β114Updated last month
- Large Language Model (LLM) Inference API and Chatbotβ125Updated last year
- β123Updated 7 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understandβ184Updated last week
- Various installation guides for Large Language Modelsβ68Updated last month
- β46Updated 2 months ago
- Direct Preference Optimization Implementationβ16Updated last year
- β77Updated 11 months ago
- β23Updated last year
- β15Updated last year
- minimal GRPO implementation from scratchβ90Updated 2 months ago
- Deep Learning for Computer Visionβ55Updated 11 months ago
- Reference implementation of Mistral AI 7B v0.1 model.β29Updated last year