por
Thomas Dworetzky, Contributing Reporter | July 23, 2018
The new publicly-accessible medical imaging database, DeepLesion, is a “critical step forward in computer-aided radiology detection, diagnosis, and deep learning,” according to the paper announcing its availability in the Journal of Medical Imaging.
It is the largest CT lesion-image database ever made available to the public, with over 32,000 annotated lesions from over 10,000 cases, according to the team from the National Institutes of Health Clinical Center that developed it. Such huge, annotated radiological datasets are essential in the creation of deep learning approaches to medical data.
"We hope the data set will benefit the medical imaging area just as ImageNet benefited the computer vision area," said Ke Yan, the lead author on the paper and a postdoctoral fellow with senior author Dr. Ronald Summers, senior investigator and staff radiologist at the center.
Ad Statistics
Times Displayed: 16169
Times Visited: 33 Final days to save an extra 10% on Imaging, Ultrasound, and Biomed parts web prices.* Unlimited use now through September 30 with code AANIV10 (*certain restrictions apply)
DeepLesion was creating by “mining” historical medical data from the Institute's own Picture Archiving and Communication System (PACS).
“This new dataset has tremendous potential to jump-start the field of computer-aided detection (CADe) and diagnosis (CADx),” according to the release.
DeepLesion differs from most other medical image datasets now available, which are only able to spot one type of lesion, according to the NIH in a statement.
When examining CT images radiologists at the Clinical Center measure and mark clinically significant findings using “electronic bookmarks”, which can be complex and include arrows, lines, diameters, and text to pinpoint the tumor's location and size, to enable experts to spot growth or new disease.
“The bookmarks, abundant with retrospective medical data, are what scientists used to develop the DeepLesion dataset,” stated the NIH, noting that unlike most other datasets, DeepLesion has great diversity, with “all kinds of critical radiology findings from across the body, such as lung nodules, liver tumors, enlarged lymph nodes, and so on.”
The lack of such a multiple category lesion data set “has been a major roadblock to development of more universal CADe frameworks capable of detecting multiple lesion types.
This new multi-category dataset could “even enable development of CADx systems that automate radiological diagnosis,” according to the statement.
The team also created a universal lesion detector from their work on DeepLesion, and noted that while detection is time-consuming for radiologists, it is crucial to diagnosis. The thought is that this detector could be used in the future for screening by either radiologists or other CADe systems.