Why Automation Testing of LLMs is So Damn Hard