COVID-CT-Dataset: CT Scan Datasets about COVID-19
A paper in Arxiv "COVID-CT-Dataset: A CT Scan Dataset about COVID-19"
Jinyu Zhao (UC San Diego), Yichen Zhang (UC San Diego), Xuehai He (UC San Diego), and Pengtao Xie (UC San Diego, Petuum Inc)
LINK TO THE PAPER
CT scans are promising in providing accurate, fast, and cheap screening and testing of COVID-19. In this paper, we build a publicly available COVID-CT dataset, containing 275 CT scans that are positive for COVID-19, to foster the research and development of deep learning methods which predict whether a person is affected with COVID-19 by analyzing his/her CTs. We train a deep convolutional neural network on this dataset and achieve an F1 of 0.85 which is a promising performance but yet to be further improved.
The data and code are available at:
The COVID-CT-Dataset has 349 CT images containing clinical findings of COVID-19. They are in ./Images-processed/CT_COVID.zip
Non-COVID CT scans are in ./Images-processed/CT_NonCOVID.zip
We provide a data split in ./Data-split
The meta information (e.g., patient ID, DOI, image caption) is in COVID-CT-MetaInfo.xlsx
The images are collected from COVID19-related papers from medRxiv, bioRxiv, NEJM, JAMA, Lancet, etc. CTs containing COVID-19 abnormalities are selected by reading the figure captions in the papers. All copyrights of the data belong to the authors and publishers of these papers.