hkgc-1 / GHPOView external linksLinks
β59Jul 21, 2025Updated 6 months ago
Alternatives and similar repositories for GHPO
Users that are interested in GHPO are comparing it to the libraries listed below
Sorting:
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).β12Sep 22, 2025Updated 4 months ago
- πOfficial code of our AAAI26 paper πWebFilterβ36Nov 9, 2025Updated 3 months ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Modelsβ20Oct 24, 2024Updated last year
- An interactive thinking and deep reasoning model. It provides a cognitive reasoning paradigm for complex multi-hop problems.β78Nov 14, 2025Updated 3 months ago
- [ICLR24] AutoVP: An Automated Visual Prompting Framework and Benchmarkβ21Sep 18, 2025Updated 4 months ago
- β36Feb 2, 2026Updated 2 weeks ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizersβ34Apr 18, 2025Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β34Mar 2, 2024Updated last year
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Trainingβ47Jul 18, 2025Updated 6 months ago
- β46Sep 27, 2025Updated 4 months ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resourcesβ17Nov 4, 2025Updated 3 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684β45Oct 20, 2025Updated 3 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.β24Updated this week
- This project showcases engaging interactions between two AI chatbots.β10Jan 10, 2024Updated 2 years ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.β172Jan 29, 2026Updated 2 weeks ago
- Codes for Merging Large Language Modelsβ35Aug 7, 2024Updated last year
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"β56Jun 21, 2025Updated 7 months ago
- β10Sep 29, 2024Updated last year
- Building a multi-agent RAG system with advanced RAG methodsβ12Jan 12, 2025Updated last year
- β20Aug 8, 2025Updated 6 months ago
- This is the code of a agentic rag method with dynamic workflow.β13Jan 22, 2026Updated 3 weeks ago
- Source code for SWIFT, an efficient reward model.β18Jan 13, 2026Updated last month
- TOD-Flow: Modeling the Structure of Task-Oriented Dialoguesβ13Feb 7, 2024Updated 2 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)β12Oct 18, 2022Updated 3 years ago
- [ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"β12Dec 20, 2024Updated last year
- [AAAI 2026] Official code for "Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Lβ¦β14Nov 17, 2025Updated 3 months ago
- β12Mar 1, 2025Updated 11 months ago
- Aline: Agentic Git for Vibe Codersβ36Nov 26, 2025Updated 2 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selectionβ22May 31, 2025Updated 8 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoningβ14Jun 28, 2025Updated 7 months ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"β12Mar 19, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)β11Apr 18, 2025Updated 9 months ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regimeβ13Feb 12, 2024Updated 2 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent Cβ¦β23May 2, 2025Updated 9 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbolsβ13Aug 13, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- β30Sep 19, 2025Updated 4 months ago
- Modified Beam Search with periodical restartβ12Sep 12, 2024Updated last year
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signalsβ11Jan 8, 2026Updated last month