uygarkurt / BERT-PyTorchLinks
☆17Updated last year
Alternatives and similar repositories for BERT-PyTorch
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
Sorting:
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆122Updated 2 years ago
- Tutorial for how to build BERT from scratch☆102Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- a simplified version of Meta's Llama 3 model to be used for learning☆44Updated last year
- ☆82Updated last year
- Distributed training (multi-node) of a Transformer model☆93Updated last year
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆48Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆74Updated 2 years ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated 2 years ago
- nanogpt turned into a chat model☆81Updated 2 years ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆19Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated 2 years ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- ☆46Updated 8 months ago
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Updated 3 weeks ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆47Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- LoRA and DoRA from Scratch Implementations☆215Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆90Updated 6 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Updated last year
- Playground for Transformers☆53Updated 2 years ago
- ☆16Updated 2 years ago
- Set of scripts to finetune LLMs☆38Updated last year
- ☆30Updated last year
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated 2 years ago
- Prune transformer layers☆74Updated last year