Python package for generating datasets to evaluate reasoning and retrieval of large language models
☆22Jun 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for phantom-wiki
Users that are interested in phantom-wiki are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆20Mar 4, 2025Updated last year
- ☆27Dec 8, 2025Updated 6 months ago
- Materials for a language modeling class, broadly construed☆36Mar 28, 2026Updated 2 months ago
- ☆30Oct 31, 2025Updated 7 months ago
- ☆11Sep 10, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts☆29Feb 20, 2026Updated 3 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated 2 years ago
- ☆45Apr 22, 2025Updated last year
- 🗜️Codebase of the ACIP algorithm 🗜️☆18Feb 11, 2026Updated 4 months ago
- [COLM '25] Single-Pass Document Scanning for Question Answering☆14Aug 20, 2025Updated 9 months ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- This repository contains data, code and models for contextual noncompliance.☆26Jul 18, 2024Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- ☆23Mar 2, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NAACL 2024] Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers https://arxiv.org/abs/2307.…☆17Jan 27, 2024Updated 2 years ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 5 months ago
- r4c☆14Mar 2, 2021Updated 5 years ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Apr 10, 2026Updated 2 months ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated 2 years ago
- Make it easy to automatically and uniformly measure the behavior of many AI Systems.☆27Oct 2, 2024Updated last year
- ☆17Aug 1, 2025Updated 10 months ago
- ☆33Jan 31, 2026Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆43Aug 31, 2024Updated last year
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆45May 20, 2026Updated 3 weeks ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆40Nov 26, 2025Updated 6 months ago
- ☆12Mar 25, 2024Updated 2 years ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 6 months ago
- ☆19Jan 3, 2025Updated last year
- ☆12Mar 7, 2022Updated 4 years ago
- ☆13Nov 5, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 11 months ago
- RAG Hallucination Detecting By LRP.☆12Mar 31, 2025Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 9 months ago
- Build GDrive CLI (https://github.com/Msameim181/gdrive) with your own credentials using GitHub Actions☆16Jan 16, 2023Updated 3 years ago
- ☆12May 6, 2024Updated 2 years ago
- Efficient multi-prompt evaluation of LLMs☆33Dec 6, 2024Updated last year
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 3 years ago