uygarkurt / BERT-PyTorchLinks
β17Updated last year
Alternatives and similar repositories for BERT-PyTorch
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
Sorting:
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- Tutorial for how to build BERT from scratchβ101Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorchβ119Updated 2 years ago
- nanogpt turned into a chat modelβ80Updated 2 years ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated 2 years ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ113Updated last year
- Distributed training (multi-node) of a Transformer modelβ90Updated last year
- β20Updated 4 years ago
- BERT explained from scratchβ16Updated 2 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translationβ90Updated 6 months ago
- Prune transformer layersβ74Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)β¦β74Updated 2 years ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β53Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.β195Updated last year
- Training and Fine-tuning an llm in Python and PyTorch.β43Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β244Updated last year
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023β58Updated 2 years ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensivβ¦β12Updated last year
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.β48Updated 2 years ago
- β42Updated last year
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeedβ18Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 3 months ago
- β27Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ73Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedbackβ96Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β55Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.β195Updated last year
- Playground for Transformersβ53Updated 2 years ago
- several types of attention modules written in PyTorch for learning purposesβ52Updated last week
- β78Updated 2 years ago