LLMSQL / llmsql-benchmarkLinks
A Text2SQL benchmark for evaluation of Large Language Models
☆41Updated this week
Alternatives and similar repositories for llmsql-benchmark
Users that are interested in llmsql-benchmark are comparing it to the libraries listed below
Sorting:
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Updated last year
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆17Updated 4 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Updated 5 months ago
- ☆24Updated 3 months ago
- ☆14Updated 10 months ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 10 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last month
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Updated last year
- Official Implementation of HIMA (COLM'25)☆17Updated 2 weeks ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated 2 months ago
- Control LLM☆20Updated 8 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆18Updated 10 months ago
- ☆16Updated last year
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆17Updated 4 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆23Updated 3 months ago
- CS194-196 Course Project☆14Updated 9 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆17Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆17Updated 9 months ago
- ☆14Updated 11 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Updated 5 months ago
- ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities☆14Updated 10 months ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Updated 6 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 8 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆21Updated last month
- ☆16Updated last year
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆15Updated 7 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Updated 11 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆22Updated 8 months ago
- Learning to Skip the Middle Layers of Transformers☆15Updated 4 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆14Updated last month