Columbia-NLP-Lab / PAPILLON
Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles
☆21Updated last month
Alternatives and similar repositories for PAPILLON:
Users that are interested in PAPILLON are comparing it to the libraries listed below
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated last month
- Functional Benchmarks and the Reasoning Gap☆82Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- Small, simple agent task environments for training and evaluation☆18Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated 2 weeks ago
- ☆48Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- The first dense retrieval model that can be prompted like an LM☆64Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated 2 months ago
- ☆49Updated 2 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆46Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆99Updated 5 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 5 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆24Updated 2 months ago
- ☆20Updated last year
- ☆106Updated 3 weeks ago
- ☆57Updated 4 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆70Updated 4 months ago