akanyaani / miniLLAMALinks
A simplified LLAMA implementation for training and inference tasks.
☆33Updated 7 months ago
Alternatives and similar repositories for miniLLAMA
Users that are interested in miniLLAMA are comparing it to the libraries listed below
Sorting:
- GGUF Quantization of any LLM.☆41Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆171Updated 2 years ago
- Learn the building blocks of how to build gpt-oss from scratch☆113Updated 4 months ago
- Fine-tuning LLMs using QLoRA☆269Updated last year
- a simplified version of Meta's Llama 3 model to be used for learning☆44Updated last year
- Train your own small bitnet model☆77Updated last year
- Utils for Unsloth https://github.com/unslothai/unsloth☆191Updated this week
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆39Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆213Updated last year
- LLM Workshop by Sourab Mangrulkar☆401Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.☆102Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Scripts for text classification with llama and bert☆32Updated 6 months ago
- Various installation guides for Large Language Models☆77Updated 9 months ago
- ☆127Updated 10 months ago
- Google TPU optimizations for transformers models☆134Updated 2 weeks ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Updated last year
- nanogpt turned into a chat model☆81Updated 2 years ago
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆48Updated 2 years ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Updated 11 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated last year
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆168Updated last year
- One click templates for inferencing Language Models☆228Updated 2 months ago