bigscience-workshop / model_cardLinks
ā25Updated 3 years ago
Alternatives and similar repositories for model_card
Users that are interested in model_card are comparing it to the libraries listed below
Sorting:
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Modelsā70Updated 2 years ago
- š¤ Disaggregators: Curated data labelers for in-depth analysis.ā67Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedā36Updated 2 years ago
- ChatGPT Participates in a Computer Science Exam (2023)ā31Updated 2 years ago
- Evaluation suite for large-scale language models.ā129Updated 4 years ago
- Developing tools to automatically analyze datasetsā75Updated last year
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.ā54Updated 2 months ago
- Hugging Face and Pyserini interoperabilityā19Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.ā71Updated 3 years ago
- Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)ā11Updated 2 years ago
- ā128Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Modelsā86Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.ā49Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.ā308Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā35Updated 2 years ago
- Smol but mighty language modelā63Updated 2 years ago
- ā22Updated 2 years ago
- ā172Updated 10 months ago
- Blazing fast training of š¤ Transformers on Graphcore IPUsā86Updated last year
- Leverage your LangChain trace data for fine tuningā46Updated last year
- Repository for analysis and experiments in the BigCode project.ā128Updated last year
- API Client for paperswithcode.comā189Updated last year
- ā17Updated 2 years ago
- ā92Updated 3 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.ā171Updated 3 months ago
- Multi-Domain Expert Learningā67Updated last year
- ā23Updated 2 years ago
- ā101Updated last month
- Like picoGPT but for BERT.ā51Updated 2 years ago
- Stuff related to scraping the Code Review StackExchangeā12Updated 2 years ago