YebowenHu / SportsGenLinks
☆18Updated last year
Alternatives and similar repositories for SportsGen
Users that are interested in SportsGen are comparing it to the libraries listed below
Sorting:
- ☆22Updated 2 months ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Updated 2 years ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆111Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated 2 years ago
- ☆20Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- ☆13Updated 3 months ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆22Updated 2 years ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆55Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆29Updated last year
- This repo is reproduction resources for linear alignment paper, still working☆16Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆11Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 6 months ago
- ☆29Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆27Updated 3 months ago
- Evaluate the Quality of Critique☆36Updated last year
- ☆22Updated last year
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆12Updated last year
- ☆46Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆41Updated 5 months ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Updated 11 months ago
- ☆36Updated last year
- Directional Preference Alignment☆57Updated last year
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated 2 years ago
- ☆27Updated 2 years ago
- code for "Natural Language to Code Translation with Execution"☆41Updated 2 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆62Updated 2 years ago