ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference π
β47Updated 11 months ago
Alternatives and similar repositories for Meta-llama:
Users that are interested in Meta-llama are comparing it to the libraries listed below
- Building GPT ...β17Updated 5 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ107Updated 7 months ago
- Prune transformer layersβ69Updated 11 months ago
- Fine-tune an LLM to perform batch inference and online serving.β110Updated this week
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ104Updated 3 months ago
- Making of cuda kernelβ15Updated 2 weeks ago
- Set of scripts to finetune LLMsβ37Updated last year
- A collection of hand on notebook for LLMs practitionerβ47Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Direct Preference Optimization Implementationβ16Updated last year
- minimal GRPO implementation from scratchβ87Updated last month
- β143Updated 9 months ago
- Various installation guides for Large Language Modelsβ69Updated last week
- Notebooks for fine tuning pali gemmaβ100Updated 3 weeks ago
- Distributed training (multi-node) of a Transformer modelβ65Updated last year
- A collection of fine-tuning notebooks!β27Updated last year
- A template to kick-start your Python project β¨πβ51Updated 4 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β118Updated 2 weeks ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- zero-to-lightningβ30Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ102Updated last month
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β115Updated last week
- Reference implementation of Mistral AI 7B v0.1 model.β28Updated last year
- A repository containing general tutorials I'd like to share with the world.β43Updated 2 weeks ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understandβ180Updated last week
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- Repo for ML Models built from scratch such as Self-Attention, Linear +Logistic Regression, PCA, LDA. CNN, LSTM, Neural Networks using Nuβ¦β47Updated 3 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.β25Updated 4 months ago