jaymody / picoBERT
Like picoGPT but for BERT.
☆50Updated last year
Alternatives and similar repositories for picoBERT:
Users that are interested in picoBERT are comparing it to the libraries listed below
- ☆37Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- ☆48Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆81Updated last year
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆11Updated 8 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- ☆40Updated 8 months ago
- ☆24Updated last year
- ☆153Updated last year
- Supercharge huggingface transformers with model parallelism.☆76Updated 3 months ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- ☆64Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month
- ☆91Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆97Updated 10 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 3 months ago
- some common Huggingface transformers in maximal update parametrization (µP)☆78Updated 2 years ago
- Learning to Program with Natural Language☆4Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 5 months ago
- PageRank for LLMs☆35Updated this week
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago