The offering measures AI's real-world performance and safety around handling realistic medical conversations, using physician-created rubrics and GPT-4.1 scoring.
Photo: John Fedele/Getty Images
Photo: John Fedele/Getty Images