Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL
☆65Oct 16, 2024Updated last year
Alternatives and similar repositories for planetarium
Users that are interested in planetarium are comparing it to the libraries listed below
Sorting:
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆58Apr 3, 2025Updated 11 months ago
- Extended Few-Shot Learning: Exploiting Existing Resources for Novel Tasks☆11Jul 6, 2021Updated 4 years ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- ☆11Aug 29, 2022Updated 3 years ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- The AIPlan4EU Unified Planning Library☆294Updated this week
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- ☆15Oct 4, 2024Updated last year
- Using Gradio interface to build UI for converting text to speech☆13Jan 26, 2021Updated 5 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- Library for LLM-driven action model acquisition via natural language☆48Feb 18, 2026Updated 2 weeks ago
- ☆19Aug 4, 2025Updated 7 months ago
- ☆21Dec 19, 2025Updated 2 months ago
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- ☆18May 19, 2023Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 9 months ago
- Code and data for paper "(How) do Language Models Track State?"☆20Mar 31, 2025Updated 11 months ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- ☆19Jul 24, 2025Updated 7 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- Learning to compose soft prompts for compositional zero-shot learning.☆94Sep 13, 2025Updated 5 months ago
- ☆25May 7, 2025Updated 9 months ago
- BIBench:数据分析领域LLM评测基准☆22Mar 2, 2024Updated 2 years ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆22Nov 9, 2025Updated 3 months ago
- My personal web page☆11Feb 17, 2026Updated 2 weeks ago
- [JAG'26] SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence☆59Jan 8, 2026Updated last month
- ☆451Sep 27, 2023Updated 2 years ago
- Code repo for RobotFingerPrint paper☆27Jun 6, 2025Updated 8 months ago
- ☆47Oct 2, 2025Updated 5 months ago
- Unlock level without hassle in Candy Crush Saga☆22Sep 5, 2017Updated 8 years ago
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆30Mar 26, 2024Updated last year
- Program and links to the material for the GloBIAS Training School 2025, Kobe, Japan.☆22Oct 27, 2025Updated 4 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated 2 weeks ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆203Jul 17, 2024Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- ☆33Jan 7, 2025Updated last year