BRZ911 / Wrong-of-Thought
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information (WoT)
☆13Updated 7 months ago
Alternatives and similar repositories for Wrong-of-Thought
Users that are interested in Wrong-of-Thought are comparing it to the libraries listed below
Sorting:
- ☆95Updated last month
- The reinforcement learning codes for dataset SPA-VL☆32Updated 10 months ago
- ☆48Updated 11 months ago
- Accepted by ECCV 2024☆129Updated 7 months ago
- ☆117Updated 8 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆209Updated 2 weeks ago
- ☆32Updated 7 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆113Updated last week
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆80Updated 2 months ago
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.☆11Updated 3 months ago
- ☆23Updated 6 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆74Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆52Updated last month
- ☆73Updated 11 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆38Updated last month
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆125Updated this week
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆74Updated 5 months ago
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆68Updated 9 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆298Updated last month
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆104Updated 7 months ago
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models☆36Updated 9 months ago
- ☆26Updated 6 months ago
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆17Updated 6 months ago
- A curated list of personalized alignment resources (continually updated).☆21Updated this week
- 【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185☆17Updated 4 months ago
- Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆37Updated 2 months ago
- Accepted by IJCAI-24 Survey Track☆202Updated 8 months ago
- ☆47Updated 5 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆145Updated 2 months ago
- [EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mech…☆17Updated 7 months ago