lili-chen/self-questioning-lm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lili-chen/self-questioning-lm)

lili-chen / self-questioning-lm

Self-Questioning Language Models

☆56

Alternatives and similar repositories for self-questioning-lm

Users that are interested in self-questioning-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
howard-yen / SLIM
View on GitHub
☆27Jun 22, 2026Updated last month
cvenhoff / thinking-llms-interp
View on GitHub
☆25Jul 8, 2026Updated 3 weeks ago
mihirp1998 / Slot-TTA
View on GitHub
Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.
☆26Jun 20, 2023Updated 3 years ago
alexmartin1722 / wikivideo
View on GitHub
WikiVideo: Article Generation from Multiple Videos
☆15Nov 14, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
GChrysostomou / saloss
View on GitHub
☆11Dec 23, 2021Updated 4 years ago
complex-reasoning / RPG
View on GitHub
[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)
☆76Jun 29, 2026Updated last month
swarnaHub / SummarizationPrograms
View on GitHub
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
☆23Jun 19, 2023Updated 3 years ago
Quark-Medical / QuarkMed
View on GitHub
☆11Sep 16, 2025Updated 10 months ago
danishpruthi / evidence-extraction
View on GitHub
Code for paper: Weakly- and Semi-supervised Evidence Extraction
☆15Apr 12, 2021Updated 5 years ago
kristinagligoric / confidence-driven-inference
View on GitHub
☆17Jul 23, 2025Updated last year
johncava / pytorch-IPOT
View on GitHub
Unofficial pytorch implementation of IPOT for improved Seq2Seq Learning
☆14Dec 4, 2021Updated 4 years ago
Chengsong-Huang / R-Zero
View on GitHub
[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆825Feb 4, 2026Updated 5 months ago
wmn-231314 / diffusion-data-constraint
View on GitHub
Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…
☆127Jan 10, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
oceanoceanna / LLMEraser
View on GitHub
☆15Feb 26, 2025Updated last year
lichengliu03 / unary-feedback
View on GitHub
☆44Mar 31, 2026Updated 3 months ago
wuxiyang1996 / COS-PLAY
View on GitHub
COS-PLAY: Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Game Play
☆29Jul 11, 2026Updated 2 weeks ago
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆27Mar 28, 2026Updated 4 months ago
lilakk / BLEUBERI
View on GitHub
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆32Jun 5, 2025Updated last year
affect2mm / emotion-timeseries
View on GitHub
☆16Nov 24, 2020Updated 5 years ago
mihirp1998 / Disentangling-3D-Prototypical-Nets
View on GitHub
We present neural architectures that disentangle RGB-D images into objects' shapes and styles and a map of the background scene, and expl…
☆11Jul 26, 2021Updated 5 years ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
bsivanantham / GAE
View on GitHub
Reinforcement learning algorithms with Generalized Advantage Estimation
☆22Jun 6, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
katiekang1998 / llm_hallucinations
View on GitHub
☆18May 28, 2024Updated 2 years ago
Qwen-Applications / GD2PO
View on GitHub
☆20Jun 16, 2026Updated last month
Small-Model-Gap / Small-Model-Learnability-Gap
View on GitHub
☆23Oct 10, 2025Updated 9 months ago
yf-he / EvoTest
View on GitHub
EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems (ICLR'26)
☆25Nov 3, 2025Updated 8 months ago
danijar / diamond_env
View on GitHub
Standardized Minecraft Diamond Environment for Reinforcement Learning
☆40May 19, 2023Updated 3 years ago
kvfrans / lmpo
View on GitHub
☆141Dec 9, 2025Updated 7 months ago
princeton-pli / RLMT
View on GitHub
[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"
☆129Oct 27, 2025Updated 9 months ago
gepa-ai / gepa-artifact
View on GitHub
☆35Feb 8, 2026Updated 5 months ago
wutaiqiang / awesome-GNN2MLP-distillation
View on GitHub
Learning MLPs to replace GNN
☆10Jun 3, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhentingqi / evolm
View on GitHub
☆75Jun 23, 2025Updated last year
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
WujiangXu / EPO
View on GitHub
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆40Jul 13, 2026Updated 2 weeks ago
MingyuJ666 / Disentangling-Memory-and-Reasoning
View on GitHub
[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.
☆87Nov 2, 2025Updated 8 months ago
disi-unibo-nlp / medgenie
View on GitHub
The First Generate-then-Read Framework for Multiple-Choice Question Answering in Medicine
☆15May 27, 2024Updated 2 years ago
sail-sg / VeriFree
View on GitHub
Reinforcing General Reasoning without Verifiers
☆102Jun 24, 2025Updated last year
Code2Q / TagCF
View on GitHub
☆17Nov 6, 2025Updated 8 months ago