DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 3 years ago
Alternatives and similar repositories for GPT-J-hf:
Users that are interested in GPT-J-hf are comparing it to the libraries listed below
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- ☆32Updated 2 years ago
- ☆15Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆18Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- ☆11Updated 4 years ago
- Implementation of stop sequencer for Huggingface Transformers☆16Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Updated 3 years ago
- ☆17Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- Kaggle fashion dataset in dalle format☆13Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 4 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Updated 2 years ago
- ☆14Updated 7 months ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- ☆37Updated last year
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Updated last year
- Megatron LM 11B on Huggingface Transformers☆27Updated 3 years ago
- exBERT on Transformers🤗☆10Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆11Updated 4 months ago
- ☆43Updated 2 years ago
- Calculating Expected Time for training LLM.☆38Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year