Microsoft and Salesforce analyzed 200,000+ AI chats, finding models lose reliability in multiโturn dialogue despite strong singleโprompt performance.


Microsoft and Salesforce analyzed 200,000+ AI chats, finding models lose reliability in multiโturn dialogue despite strong singleโprompt performance.