bigscience-workshop / model_card
☆24Updated 2 years ago
Alternatives and similar repositories for model_card:
Users that are interested in model_card are comparing it to the libraries listed below
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Hugging Face and Pyserini interoperability☆20Updated last year
- ☆24Updated last year
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- Embedding Recycling for Language models☆38Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- ☆28Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated last year
- ☆47Updated last year
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 6 months ago
- Developing tools to automatically analyze datasets☆74Updated 5 months ago
- Evaluation suite for large-scale language models.☆124Updated 3 years ago
- ☆33Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆37Updated 2 years ago
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆26Updated 3 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 10 months ago
- ☆52Updated 3 months ago
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- One stop shop for all things carp☆59Updated 2 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year