snap-stanford / starkLinks
(NeurIPS D&B 2024) STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases
☆327Updated last month
Alternatives and similar repositories for stark
Users that are interested in stark are comparing it to the libraries listed below
Sorting:
- (NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning☆235Updated 7 months ago
- ☆55Updated last year
- GraphRAG-Bench, the official repo of comprehensive benchmark and dataset for evaluating GraphRAG models.☆336Updated this week
- [KDD 2024]this is project for training explicit graph reasoning large language models.☆101Updated last year
- [EMNLP2025] "GraphAgent: Agentic Graph Language Assistant"☆337Updated 11 months ago
- Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.☆290Updated 3 years ago
- [NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs☆130Updated 8 months ago
- [ACL 24 main] Large Language Models Can Learn Temporal Reasoning☆64Updated last year
- TxBKG - Knowledge Graph Generation for Any PDFs☆188Updated last year
- "AnyGraph: Graph Foundation Model in the Wild"☆220Updated last year
- [ICLR'24] Enhancing Healthcare Predictions with Personalized Knowledge Graphs☆196Updated 11 months ago
- [EMNLP'2024] "XRec: Large Language Models for Explainable Recommendation"☆168Updated last year
- [ACL2025] "RecLM: Recommendation Instruction Tuning"☆108Updated 8 months ago
- Benchmarking LLMs via Uncertainty Quantification☆255Updated 2 years ago
- A curated list of awesome leaderboard-oriented resources for AI domain☆307Updated this week
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆171Updated last year
- A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆177Updated 6 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆117Updated last year
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆125Updated last year
- (ACL 2025 Main) A Comprehensive Benchmark for Code Information Retrieval.☆147Updated 7 months ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆65Updated last year
- (ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators☆275Updated 4 months ago
- [EMNLP'2024] "OpenGraph: Towards Open Graph Foundation Models"☆322Updated last year
- [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery☆296Updated 3 months ago
- This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines s…☆681Updated 3 weeks ago
- [SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"☆805Updated last year
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆180Updated 7 months ago
- An interpretable large language model (LLM) for medical diagnosis.☆158Updated last year
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆317Updated 6 months ago
- Code and dataset of CodeSteer☆88Updated 10 months ago