johnrobinsn / redpajamaLinks
Training and Inference Notebooks for the RedPajama (OpenLlama) models
☆19Updated 2 years ago
Alternatives and similar repositories for redpajama
Users that are interested in redpajama are comparing it to the libraries listed below
Sorting:
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated 2 years ago
- ☆94Updated last year
- NLP Examples using the 🤗 libraries☆40Updated 4 years ago
- Functional local implementations of main model parallelism approaches☆96Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- ☆23Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- A library for squeakily cleaning and filtering language datasets.☆47Updated 2 years ago
- experiments with inference on llama☆104Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Codes, scripts, and notebooks on various aspects of transformer models.☆27Updated 2 years ago
- ☆13Updated 2 years ago
- A diff tool for language models☆44Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆34Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated this week
- Helper scripts and notes that were used while porting various nlp models☆47Updated 3 years ago
- Text to Python Objects via a LLM Function Call☆58Updated last year
- ☆88Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Gzip and nearest neighbors for text classification☆57Updated 2 years ago
- ☆46Updated 2 years ago
- Evaluation suite for large-scale language models.☆128Updated 4 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆311Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆36Updated 4 months ago