luohongyin / LangCode
LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).
☆42Updated last year
Alternatives and similar repositories for LangCode:
Users that are interested in LangCode are comparing it to the libraries listed below
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆36Updated 2 months ago
- ☆49Updated 8 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆26Updated last month
- Functional Benchmarks and the Reasoning Gap☆84Updated 5 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆70Updated 3 months ago
- ☆39Updated 8 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 3 months ago
- ☆75Updated this week
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- ☆28Updated 4 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- ☆20Updated 10 months ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆49Updated last month
- ☆23Updated 6 months ago
- ☆40Updated last month
- Code/data for MARG (multi-agent review generation)☆41Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆58Updated 5 months ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆29Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆37Updated last week