CG80499 / trlx-with-T5Links
[Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆47Updated 2 years ago
Alternatives and similar repositories for trlx-with-T5
Users that are interested in trlx-with-T5 are comparing it to the libraries listed below
Sorting:
- Reimplementation of the task generation part from the Alpaca paper☆118Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 9 months ago
- ☆48Updated last year
- Drive a browser with Cohere☆71Updated 2 years ago
- GPT-based Conversation Summarizer☆148Updated 2 years ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆151Updated last year
- Demo of ConversationEntityMemory in Streamlit.☆51Updated 2 years ago
- Chat Markup Language conversation library☆55Updated last year
- ☆22Updated last year
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- ☆92Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆49Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- ☆94Updated 5 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated 2 weeks ago
- ☆60Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- ☆41Updated 2 years ago
- Smol but mighty language model☆61Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated last year
- ☆38Updated 10 months ago
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- ☆23Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago