bird-bench / livesqlbenchLinks
☆111Updated 2 months ago
Alternatives and similar repositories for livesqlbench
Users that are interested in livesqlbench are comparing it to the libraries listed below
Sorting:
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆198Updated 8 months ago
- ☆44Updated last year
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆28Updated last year
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 11 months ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆42Updated 2 months ago
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆42Updated 2 months ago
- [EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models☆90Updated 3 weeks ago
- [NeurIPS 2025] DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆74Updated 4 months ago
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆73Updated 2 months ago
- Official Implementation of "Pay Attention to What You Need"☆44Updated 11 months ago
- ☆53Updated last year
- The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.☆57Updated 7 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆44Updated last year
- [NeurIPS‘24] LLM4EA: Entity Alignment with Noisy Annotations from Large Language Models☆62Updated 3 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- [ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Mode…☆39Updated 5 months ago
- Official Code of Logits-Based-Finetuning☆91Updated 7 months ago
- ☆48Updated last year
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆39Updated 9 months ago
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆41Updated last year
- ☆165Updated 2 months ago
- ☆62Updated last year
- ☆73Updated 2 years ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆38Updated 11 months ago
- Typeless Programming Language `sicpy` and Compiler;☆32Updated 2 years ago
- ☆53Updated 5 months ago
- [ACL2024 Findings] Towards Better Question Generation in QA-based Event Extraction☆48Updated 3 weeks ago
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated 2 years ago
- ACL 2024☆35Updated 6 months ago
- Official dataset link for ''Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension''☆22Updated 4 years ago