hiyouga / transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β9Updated last month
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- β15Updated last month
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-trainingβ16Updated 7 months ago
- Code and data from the paper 'Human Feedback is not Gold Standard'β19Updated 10 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response formatβ27Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuningβ34Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelβ44Updated last year
- β45Updated 9 months ago
- β31Updated 6 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)β29Updated last year
- β26Updated 10 months ago
- Companion code to https://arxiv.org/abs/2409.03797v2β10Updated 3 weeks ago
- β32Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Updated last year
- Verifiers for LLM Reinforcement Learningβ50Updated last month
- β27Updated 2 months ago
- β27Updated last month
- a tool for gerenate dataset from docβ12Updated last month
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"β18Updated last month
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofsβ35Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ19Updated 3 months ago
- β17Updated last year
- β43Updated 3 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verificationβ40Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 2 years ago
- Measuring RAG solutions throughput and latencyβ17Updated 9 months ago
- β28Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Modelsβ22Updated 9 months ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"β42Updated 7 months ago
- β37Updated this week