In our newest preprint, we discuss current explainable AI (XAI) methods. We divided the workflow of a generative decoder-only model into four information contexts for XAI: training dataset, input query, model components, and output sequence. See here:
arxiv.org/abs/2506.19532@aichemist.bsky.social