acl-org / acl-2024Links
Repository for the ACL 2024 conference website
☆18Updated 6 months ago
Alternatives and similar repositories for acl-2024
Users that are interested in acl-2024 are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆45Updated last year
- A package dedicated for running benchmark agreement testing☆17Updated 3 months ago
- SILO Language Models code repository☆81Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Updated last year
- ☆17Updated last month
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆96Updated 9 months ago
- Code for Zero-Shot Tokenizer Transfer☆135Updated 7 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆40Updated last year
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆38Updated 8 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆122Updated 9 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆39Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆146Updated 10 months ago
- ☆135Updated 9 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆110Updated 2 years ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆17Updated 4 months ago
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆124Updated 11 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆161Updated 3 months ago
- ☆127Updated 11 months ago
- ☆50Updated last year
- ☆52Updated 4 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆59Updated 2 weeks ago
- Benchmarking Benchmark Leakage in Large Language Models☆55Updated last year
- ☆149Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆78Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆132Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆48Updated 7 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆43Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year