CG80499 / trlx-with-T5
[Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆47Updated 2 years ago
Alternatives and similar repositories for trlx-with-T5:
Users that are interested in trlx-with-T5 are comparing it to the libraries listed below
- ☆48Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆22Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- GPT-based Conversation Summarizer☆148Updated last year
- Drive a browser with Cohere☆72Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- ☆34Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- ☆60Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- QLoRA for Masked Language Modeling☆21Updated last year
- ☆92Updated last year
- ☆24Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Completion After Prompt Probability. Make your LLM make a choice☆74Updated 4 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- ☆34Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- ☆32Updated last year
- ☆46Updated last year
- ☆131Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago