Microsoft and Salesforce analyzed 200,000+ AI chats, finding models lose reliability in multi‑turn dialogue despite strong single‑prompt performance.


Microsoft and Salesforce analyzed 200,000+ AI chats, finding models lose reliability in multi‑turn dialogue despite strong single‑prompt performance.