LLMSQL / llmsql-benchmarkLinks
A Text2SQL benchmark for evaluation of Large Language Models
☆38Updated last week
Alternatives and similar repositories for llmsql-benchmark
Users that are interested in llmsql-benchmark are comparing it to the libraries listed below
Sorting:
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Updated last year
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆12Updated 2 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Updated 4 months ago
- Official Implementation of HIMA (COLM'25)☆16Updated last month
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated last month
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆17Updated 2 months ago
- ☆24Updated 2 months ago
- ☆17Updated 4 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated last month
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 9 months ago
- ☆14Updated 9 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆19Updated last month
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 7 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆10Updated last month
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆13Updated 4 months ago
- ☆17Updated last year
- ☆14Updated 10 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆34Updated 2 weeks ago
- ☆19Updated 3 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆15Updated 6 months ago
- Code and resources for the NeurIPS 2025 Paper "BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset"☆14Updated 2 weeks ago
- Control LLM☆20Updated 6 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last week
- Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of…☆16Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆16Updated 7 months ago
- Learning to Skip the Middle Layers of Transformers☆15Updated 2 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Updated last year
- ☆25Updated 8 months ago
- ☆14Updated 11 months ago
- ☆12Updated 4 months ago