tongjingqi / Code2LogicLinks
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning
☆20Updated this week
Alternatives and similar repositories for Code2Logic
Users that are interested in Code2Logic are comparing it to the libraries listed below
Sorting:
- ☆58Updated 2 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆221Updated this week
- Large Language Models(LLMs) of Code☆18Updated 2 years ago
- ☆141Updated last year
- ☆16Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆128Updated last year
- ☆319Updated 10 months ago
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆42Updated 11 months ago
- The related works and background techniques about Openai o1☆221Updated 5 months ago
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆15Updated last month
- [ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training☆28Updated 7 months ago
- ☆240Updated last week
- Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥☆259Updated 4 months ago
- 中文大语言模型评测第二期☆70Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆370Updated 9 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆164Updated last year
- LLM hallucination paper list☆316Updated last year
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆60Updated last year
- 中文 Instruction tuning datasets☆131Updated last year
- ☆97Updated last year
- ☆25Updated 2 years ago
- ☆12Updated last year
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆368Updated 4 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆50Updated last year
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆91Updated 10 months ago
- ☆12Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆19Updated last year
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆531Updated last week
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆65Updated last year
- This is the repository of the Ape210K dataset and baseline models.☆194Updated 5 years ago