Dahoas / gpt-neox-finetuning
☆16Updated 3 years ago
Alternatives and similar repositories for gpt-neox-finetuning:
Users that are interested in gpt-neox-finetuning are comparing it to the libraries listed below
- Completion After Prompt Probability. Make your LLM make a choice☆75Updated 4 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 7 months ago
- ☆24Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆176Updated 2 months ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆49Updated 2 years ago
- ☆22Updated 3 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- Multi-Domain Expert Learning☆67Updated last year
- ✅ Pytest-style test runner for langchain projects☆25Updated 2 years ago
- Developing tools to automatically analyze datasets☆74Updated 5 months ago
- Efficient few-shot learning with cross-encoders.☆50Updated last year
- ☆37Updated last year
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆27Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆166Updated last year
- ☆30Updated last year
- ☆43Updated last month
- Tools for managing datasets for governance and training.☆83Updated last month
- ☆17Updated 10 months ago
- Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper☆15Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- An OpenAI Completions API compatible server for NLP transformers models☆64Updated last year