uygarkurt / BERT-PyTorch
β17Updated 2 months ago
Alternatives and similar repositories for BERT-PyTorch:
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluatiβ¦β39Updated last month
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β44Updated 6 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Trainingβ62Updated last month
- Set of scripts to finetune LLMsβ37Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.β119Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ59Updated last year
- β87Updated last year
- Prune transformer layersβ68Updated 10 months ago
- β143Updated 8 months ago
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 10 months ago
- Distributed training (multi-node) of a Transformer modelβ63Updated 11 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuningβ31Updated last month
- We study toy models of skill learning.β24Updated 2 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIMβ54Updated 11 months ago
- Collection of resources for RL and Reasoningβ25Updated last month
- This is the official repository for Inheritune.β111Updated last month
- β47Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifiβ¦β47Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β35Updated 11 months ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensivβ¦β12Updated 6 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 3 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeedβ14Updated 10 months ago
- β41Updated 11 months ago
- Tutorial for how to build BERT from scratchβ91Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β42Updated 10 months ago
- β16Updated last year
- β20Updated 3 years ago