uygarkurt / BERT-PyTorchLinks
☆17Updated 7 months ago
Alternatives and similar repositories for BERT-PyTorch
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
Sorting:
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 10 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆113Updated 2 years ago
- ☆84Updated last year
- Tutorial for how to build BERT from scratch☆98Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆41Updated 2 months ago
- Distributed training (multi-node) of a Transformer model☆80Updated last year
- Prune transformer layers☆69Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆70Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆67Updated 6 months ago
- Library to facilitate pruning of LLMs based on context☆32Updated last year
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆13Updated 4 months ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆203Updated 2 years ago
- Collection of autoregressive model implementation☆86Updated 4 months ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated last month
- MAFAND-MT☆57Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 11 months ago
- ☆16Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆135Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- nanogpt turned into a chat model☆72Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 11 months ago
- A pipeline for LLM knowledge distillation☆108Updated 4 months ago
- ☆43Updated 3 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆135Updated last year
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆46Updated 3 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆18Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year