A Transformer that is pretrained on a large sample of data. Concepts Reasoning Emergent Ability of LLMs