JetBrains-Research / lca-baselinesView external linksLinks
Baselines for all tasks from Long Code Arena benchmarks ποΈ
β39Mar 30, 2025Updated 10 months ago
Alternatives and similar repositories for lca-baselines
Users that are interested in lca-baselines are comparing it to the libraries listed below
Sorting:
- β12Mar 5, 2025Updated 11 months ago
- β32Jan 25, 2026Updated 2 weeks ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositoriesβ36Sep 4, 2024Updated last year
- β16Nov 26, 2024Updated last year
- β17Jun 12, 2024Updated last year
- A Python library for processing and filtering TabLibβ13Aug 24, 2024Updated last year
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β186Aug 16, 2024Updated last year
- A Tool for Mining Rich Abstract Syntax Trees from Codeβ61Oct 24, 2025Updated 3 months ago
- β21Apr 2, 2025Updated 10 months ago
- β44Jun 24, 2025Updated 7 months ago
- β49Apr 4, 2025Updated 10 months ago
- This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''β14Oct 4, 2023Updated 2 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure anβ¦β15May 18, 2022Updated 3 years ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Modelsβ23Mar 29, 2025Updated 10 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"β22Mar 18, 2025Updated 10 months ago
- β19Mar 10, 2025Updated 11 months ago
- Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288β20Oct 30, 2024Updated last year
- a survey on deep researchβ47Sep 9, 2025Updated 5 months ago
- βοΈ Tree-sitter custom toolkit for extracting function and class from raw source fileβ51Jul 1, 2024Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLPβ24)β27Oct 3, 2025Updated 4 months ago
- β28Nov 10, 2025Updated 3 months ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"β31Dec 23, 2024Updated last year
- Evaluation of source authorship attribution toolβ23Jun 5, 2021Updated 4 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoningβ23Sep 17, 2024Updated last year
- β31Jun 12, 2024Updated last year
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"β30Oct 20, 2025Updated 3 months ago
- A Text2SQL benchmark for evaluation of Large Language Modelsβ41Updated this week
- β33Feb 2, 2026Updated last week
- Unleashing the Power of Cognitive Dynamics on Large Language Modelsβ63Sep 24, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-wβ¦β13Jun 28, 2025Updated 7 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extractionβ30May 23, 2023Updated 2 years ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Modelsβ35Nov 3, 2024Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.β32Feb 26, 2025Updated 11 months ago
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasksβ14Feb 25, 2025Updated 11 months ago
- A repository of code examples to accompany the LSU CSC7809/7700/47000 course on AI foundation models.β13Apr 5, 2025Updated 10 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenariosβ16Oct 18, 2024Updated last year
- β18Jun 10, 2025Updated 8 months ago
- exploring whether LLMs perform case-based or rule-based reasoningβ30Mar 2, 2024Updated last year
- A library for red-teaming LLM applications with LLMs.β29Oct 11, 2024Updated last year