RenzeLou / AAAR-1.0View external linksLinks
The source code for running LLMs on the AAAR-1.0 benchmark.
☆18Apr 5, 2025Updated 10 months ago
Alternatives and similar repositories for AAAR-1.0
Users that are interested in AAAR-1.0 are comparing it to the libraries listed below
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 7 months ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- ☆13Sep 26, 2024Updated last year
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆16Dec 4, 2024Updated last year
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆29Aug 4, 2024Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- ☆20May 28, 2025Updated 8 months ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆21Jun 14, 2024Updated last year
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Oct 13, 2024Updated last year
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- ☆52Nov 27, 2024Updated last year
- Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)☆22Aug 28, 2023Updated 2 years ago
- Demo for advanced Java final project in 18-19 1 of Canghong Jin☆25Nov 18, 2018Updated 7 years ago
- LLM for Scientific Research Survey☆123Jan 22, 2025Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆32Dec 11, 2024Updated last year
- A curated list of papers on LLMs and agents for scientific research and development☆86Dec 11, 2024Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆85Sep 4, 2025Updated 5 months ago
- ☆67Jun 18, 2023Updated 2 years ago
- ☆27Feb 9, 2026Updated last week
- Repository of IPBench☆19Jan 4, 2026Updated last month
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- ☆35Mar 12, 2025Updated 11 months ago
- [TMLR 2024 J2C Certification] Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force Fields☆40Feb 11, 2025Updated last year
- ☆42Jul 1, 2024Updated last year
- MEXMA: Token-level objectives improve sentence representations☆42Jan 6, 2025Updated last year
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85May 10, 2022Updated 3 years ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- [ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"☆40Dec 22, 2023Updated 2 years ago
- ☆23Updated this week
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆17Feb 24, 2025Updated 11 months ago
- GBM implementation on Legate☆14Jan 28, 2026Updated 2 weeks ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- Evaluation Pipeline for medical tasks.☆12Updated this week
- ☆11May 24, 2024Updated last year
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 2 months ago
- [KDD Explore'24]Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities☆17May 7, 2025Updated 9 months ago
- A semi print-in-place hand for human-like manipulation, designed to be built by anyone.☆17Jan 5, 2026Updated last month