RenShuhuai-Andy / my-tools
my commonly-used tools
☆48Updated last week
Alternatives and similar repositories for my-tools:
Users that are interested in my-tools are comparing it to the libraries listed below
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆49Updated 3 weeks ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆62Updated 10 months ago
- ☆56Updated 4 months ago
- ☆57Updated 7 months ago
- A Survey on the Honesty of Large Language Models☆51Updated last month
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆38Updated 2 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆44Updated 2 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆55Updated 2 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- ☆60Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆38Updated last year
- Released code for our ICLR23 paper.☆63Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- ☆16Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆44Updated last month
- The official code repository for PRMBench.☆56Updated this week
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆35Updated 3 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆55Updated last month
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆19Updated 2 months ago
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Updated 5 months ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆63Updated 11 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆76Updated 11 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 10 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆26Updated 6 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆58Updated last year
- This the implementation of LeCo☆30Updated 6 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 5 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆49Updated 3 months ago
- ☆16Updated last month