VishnuPJ / MalayaLLM-Gemma2-9BLinks
☆13Updated last year
Alternatives and similar repositories for MalayaLLM-Gemma2-9B
Users that are interested in MalayaLLM-Gemma2-9B are comparing it to the libraries listed below
Sorting:
- gpt-2 from scratch in mlx☆402Updated last year
- Automatically evaluate your LLMs in Google Colab☆664Updated last year
- Training LLMs with QLoRA + FSDP☆1,528Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆923Updated last week
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆685Updated last year
- ☆446Updated last year
- The Tensor (or Array)☆452Updated last year
- The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling☆724Updated 11 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆564Updated last year
- UNet diffusion model in pure CUDA☆651Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆386Updated last month
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆819Updated 3 months ago
- LLM (Large Language Model) FineTuning☆563Updated 7 months ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,420Updated this week
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆126Updated 2 years ago
- Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposes☆80Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,034Updated 6 months ago
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆198Updated last year
- ☆687Updated 6 months ago
- A New Tamil Large Language Model (LLM) Based on Llama 2☆315Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆714Updated 2 years ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆991Updated 6 months ago
- System 2 Reasoning Link Collection☆855Updated 7 months ago
- ☆2,065Updated this week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- Llama from scratch, or How to implement a paper without crying☆580Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated last year
- ☆864Updated last year