huggingface / rlhf-interface
β33Updated 2 years ago
Alternatives and similar repositories for rlhf-interface:
Users that are interested in rlhf-interface are comparing it to the libraries listed below
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- β32Updated last year
- Experiments with generating opensource language model assistantsβ97Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- QLoRA with Enhanced Multi GPU Supportβ36Updated last year
- β65Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.β71Updated 2 years ago
- A starter kit for evaluating benchmarks on the π€ Hubβ14Updated last year
- Using short models to classify long textsβ21Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchiβ¦β32Updated 8 months ago
- β24Updated last year
- Techniques used to run BLOOM at inference in parallelβ37Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.β58Updated 2 years ago
- My explorations into editing the knowledge and memories of an attention networkβ34Updated 2 years ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 11 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ29Updated 4 months ago
- A library for squeakily cleaning and filtering language datasets.β46Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with π€`safetensors`β44Updated 8 months ago
- Embedding Recycling for Language modelsβ38Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated last year
- β34Updated last year
- [WIP] A π₯ interface for running code in the cloudβ86Updated last year
- A diff tool for language modelsβ42Updated last year
- β34Updated last year
- β42Updated 2 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.β25Updated 3 years ago
- β13Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated 11 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ40Updated 10 months ago