kilian-group / phantom-wiki
Python package for generating datasets to evaluate reasoning and retrieval of large language models
☆15Updated this week
Alternatives and similar repositories for phantom-wiki:
Users that are interested in phantom-wiki are comparing it to the libraries listed below
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆21Updated 3 weeks ago
- Repository for Skill Set Optimization☆12Updated 8 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated 2 weeks ago
- Efficient Scaling laws and collaborative pretraining.☆15Updated 2 months ago
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆17Updated 2 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆15Updated 6 months ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- Latent Large Language Models☆17Updated 7 months ago
- ☆27Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆28Updated last month
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆15Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- ☆28Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆16Updated this week
- ☆21Updated 3 weeks ago
- Official implementation of "BERTs are Generative In-Context Learners"☆26Updated last week
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆21Updated 5 months ago
- PyTorch implementation for MRL☆18Updated last year
- ☆11Updated last year
- ☆32Updated 9 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 10 months ago