Python package for generating datasets to evaluate reasoning and retrieval of large language models
☆22Feb 23, 2026Updated 2 months ago
Alternatives and similar repositories for phantom-wiki
Users that are interested in phantom-wiki are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆19Mar 4, 2025Updated last year
- ☆27Dec 8, 2025Updated 5 months ago
- Materials for a language modeling class, broadly construed☆35Mar 28, 2026Updated last month
- ☆12Nov 2, 2021Updated 4 years ago
- The official implementation of the paper "Large Scale Knowledge Washing"☆10Jun 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆44Apr 22, 2025Updated last year
- 🗜️Codebase of the ACIP algorithm 🗜️☆18Feb 11, 2026Updated 2 months ago
- ☆10Nov 15, 2023Updated 2 years ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- [COLM '25] Single-Pass Document Scanning for Question Answering☆14Aug 20, 2025Updated 8 months ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Jun 4, 2024Updated last year
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Make it easy to automatically and uniformly measure the behavior of many AI Systems.☆27Oct 2, 2024Updated last year
- This repository contains the publishable code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of …☆24Apr 11, 2023Updated 3 years ago
- ☆17Aug 1, 2025Updated 9 months ago
- Collection of resources to learn about tech used in SOCR projects☆11Dec 27, 2016Updated 9 years ago
- SWIM protocol implementation for exchanging cluster membership status and metadata.☆11Oct 9, 2023Updated 2 years ago
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆41Sep 30, 2025Updated 7 months ago
- ☆12Dec 14, 2022Updated 3 years ago
- ☆32Jan 31, 2026Updated 3 months ago
- Some improvements on Adam☆28Nov 5, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Jan 24, 2024Updated 2 years ago
- ☆12Mar 25, 2024Updated 2 years ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 5 months ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated 11 months ago
- 华中科技大学人工智能与自动化学院19级模式识别课程代码作业☆21Nov 21, 2021Updated 4 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- ☆13Nov 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 9 months ago
- RAG Hallucination Detecting By LRP.☆11Mar 31, 2025Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 7 months ago
- TOON as DSPy adapter☆26Feb 1, 2026Updated 3 months ago
- Build GDrive CLI (https://github.com/Msameim181/gdrive) with your own credentials using GitHub Actions☆16Jan 16, 2023Updated 3 years ago
- ☆12May 6, 2024Updated 2 years ago
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 3 years ago