philschmid / fine-tune-GPT-2
☆21Updated 4 years ago
Alternatives and similar repositories for fine-tune-GPT-2:
Users that are interested in fine-tune-GPT-2 are comparing it to the libraries listed below
- Using short models to classify long texts☆21Updated 2 years ago
- ☆32Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 months ago
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- ☆24Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆32Updated 2 years ago
- ☆37Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- ☆28Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- ☆31Updated 2 years ago
- A gzip-based text-classification system.☆32Updated last year
- Goldfish: Monolingual language models for 350 languages.☆15Updated 7 months ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- ☆15Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- ☆31Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆13Updated 2 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆37Updated 2 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- A series of notebooks demonstrating how to build simple NLP web apps with Gradio and Hugging Face transformers☆45Updated 3 years ago