hkproj / bert-from-scratchLinks
BERT explained from scratch
β14Updated last year
Alternatives and similar repositories for bert-from-scratch
Users that are interested in bert-from-scratch are comparing it to the libraries listed below
Sorting:
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated last year
- β39Updated last month
- β89Updated 9 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Distributed training (multi-node) of a Transformer modelβ72Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorchβ109Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creationβ111Updated 9 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- β86Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago
- Notes on Direct Preference Optimizationβ19Updated last year
- Prune transformer layersβ69Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuningβ46Updated last year
- Set of scripts to finetune LLMsβ37Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β45Updated 9 months ago
- Notes about LLaMA 2 modelβ61Updated last year
- β40Updated last year
- I learn about and explain quantizationβ26Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ116Updated 5 months ago
- Collection of autoregressive model implementationβ85Updated 2 months ago
- β179Updated last year
- Notes on quantization in neural networksβ86Updated last year
- Building GPT ...β18Updated 6 months ago
- Triton implementation of GPT/LLAMAβ19Updated 9 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedbackβ97Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 8 months ago
- Notes and commented code for RLHF (PPO)β96Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"β106Updated 8 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayβ257Updated last year