Text this: Performance of large language models in the differential diagnosis of benign and malignant biliary stricture