Why do LLMs attend to the first token?