hiyouga / transformersLinks
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
β11Updated this week
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"β14Updated 3 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"β40Updated 9 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelβ44Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Donβt Stop Pretraining? Make Prompt-based Fine-tuning Powerful Leaβ¦β74Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ43Updated last year
- β16Updated 4 months ago
- β20Updated 4 months ago
- β29Updated last year
- β22Updated 7 months ago
- β54Updated 9 months ago
- Benchmarks for Business Document Foundation Modelsβ10Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"β42Updated 4 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't relβ¦β12Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Updated last year
- β49Updated 11 months ago
- Code and data from the paper 'Human Feedback is not Gold Standard'β19Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuningβ35Updated 2 years ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTOβ¦β56Updated 2 weeks ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant β¦β15Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"β19Updated last year
- Lottery Ticket Adaptationβ39Updated 9 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response formatβ27Updated 2 years ago
- Verifiers for LLM Reinforcement Learningβ71Updated 4 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.β12Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogsβ17Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β14Updated last week
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsβ24Updated 3 years ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β85Updated 7 months ago