An important scientific benchmark that has lasted for over seven decades has been broken by artificial intelligence (AI). A ...
In a standard three-party Turing test, persona-prompted LLMs were often judged to be human, with GPT-4.5 selected over real ...