bowen-upenn / llm_token_bias
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
☆19Updated 5 months ago
Alternatives and similar repositories for llm_token_bias
Users that are interested in llm_token_bias are comparing it to the libraries listed below
Sorting:
- Evaluate the Quality of Critique☆35Updated 11 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated last month
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- AbstainQA, ACL 2024☆25Updated 7 months ago
- ☆41Updated last year
- ☆22Updated 4 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 8 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated 2 months ago
- ☆14Updated last year
- Code for the 2024 arXiv publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Mo…☆24Updated 10 months ago
- ☆23Updated 11 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 5 months ago
- Personality Alignment of Language Models☆36Updated 2 months ago
- ☆35Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆61Updated 10 months ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆26Updated 2 years ago
- ☆22Updated last week
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆66Updated 6 months ago
- ☆29Updated 4 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- ☆24Updated last month
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- ☆11Updated last year
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆16Updated 7 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆26Updated last month
- Tasks for describing differences between text distributions.☆16Updated 9 months ago
- ☆22Updated 10 months ago
- ☆14Updated last year