This repository features the question-answering dataset and code used to examine the medical knowledge recall capabilities of large language models for an ACL 2024 paper.

Link to GitHub Repository