Peeking into AIML@K
Why LLMs make silly math mistakes? This may be the biggest cause.
A particular case of implicit margin maximization by cross-entropy loss in supervised learning.