Probing Hidden States for Calibrated, Alignment-Resistant Predictions in LLMs

2025-09-19 health informatics Title + abstract only

View on medRxiv

Show abstract

Scientific applications of large language models (LLMs) demand reliable, well-calibrated predictions, but standard generative approaches often fail to fully access relevant knowledge contained in their internal representations. As a result, models appear less capable than they are, with useful information remaining latent. We present PING (Probing INternal states of Generative models), an open-source framework that trains lightweight probes on frozen, HuggingFace-compatible transformers to deliv...

Predicted journal destinations

Nature Communications

483 training papers

Journal of Biomedical Informatics

37 training papers

Scientific Reports

701 training papers

Journal of the American Medical Informatics Association

53 training papers

npj Digital Medicine

85 training papers

PLOS Computational Biology

141 training papers

PLOS Digital Health

88 training papers

35 training papers

Computers in Biology and Medicine

39 training papers

BMC Medical Informatics and Decision Making

36 training papers

15 training papers

24 training papers

1737 training papers

Journal of Medical Internet Research

81 training papers

Nature Medicine

88 training papers

Nature Genetics

72 training papers

Communications Medicine

63 training papers

JMIR Medical Informatics

16 training papers

International Journal of Medical Informatics

25 training papers

BMC Medical Research Methodology

41 training papers

The American Journal of Human Genetics

77 training papers

Science Advances

52 training papers