Polyp classification

Due to the shortage of endoscopists and pathologists with the required experience, automated risk classification of colon polyps will significantly reduce the workload. Automated systems will also improve objectivity, and allow for the increased throughput needed to enable a sucessful bowel cancer screening program. 

Colorectal cancer is the second most common cancer and the second most common cause of cancer death in Norway. Organized bowel cancer screening has recently been implemented in Norway. Bowel screening has in many countries been shown to significantly reduce both incidence and mortality rates. The implementation in Norway has elevated the need for more endoscopists and pathologists, particularly specialized gastrointestinal (GI) pathologists with experience in histological polyp examinations.

Our Polyp classification project aims to develop an automated histology classification system for bowel polyps using deep learning that classifies a polyp’s pathology according to whether it has a histology type associated with a definite low risk or a high risk for developing into cancer. With such a system properly implemented, the pathologist will have to examine only the identified potential high-risk polyps (at most 10%), whereas the remaining definite low-risk polyps (90% or more) can be left without further investigation. The approach will drastically reduce the workload for the pathologist, improve objectivity and allow for the increased throughput that is required to implement a bowel cancer screening programme in Norway.

Screening and treatment

A faecal occult blood test (FOBT) is commonly used to identify patients that should undergo a colonoscopy to examine the bowel for polyps. When polyps are identified during a colonoscopy procedure, these are removed and examined by a specialised GI-pathologist. Further follow-up is decided based on the pathologist’s examination, who classifies the polyps according to histological type, where the different types are associated with a low or a high risk of developing into invasive cancer. Patients with high-risk polyps are scheduled for more extensive and more frequent follow-up investigations than patients with low-risk polyps. The vast majority of the identified polyps are low-risk adenomas, which very seldom develop into adenocarcinomas over time and require no further treatment when removed.

Interobserver agreement in the reporting of polyp pathology is suboptimal. This situation is not ideal for the patient, who may end up with suboptimal treatment, and underpins the need for more objective guidelines and methods. Furthermore, pathologists are a scarce resource with a significant workload, and are often the bottleneck in most countries’ treatment pathways.

International collaboration

The project will be implemented in close collaboration with researchers from the University of Oxford, University College London (UCL) and Cheltenham General Hospital, where leading GI-pathologists from UCL and Cheltenham General Hospital will be responsible for the data and the data quality used for training the deep learning model. As the pathological examinations of polyps have been carried out for more than ten years as part of the screening programme in the UK, the data availability is unlimited for practical purposes. We have agreed to utilise a discovery dataset consisting of 6800 polyps from 2800 patients diagnosed at UCL, while a dataset consisting of 6000 polyps diagnosed at Cheltenham General Hospital will be used for an independent validation of the method.

Project plan

As a first step, we will develop a deep learning-based histology classification system that uses pathologists' histology classifications as ground truth. This system will automatically identify polyps with an associated low risk of developing into cancer and thereby reduce the pathologists' workload significantly, as this group constitutes the majority of polyp diagnoses. The system will support the pathologists in their polyp examination by providing classification suggestions that may improve the consistency and accuracy of polyp diagnostics. We will commence by developing a system that automatically identifies tubular adenomas without high-grade dysplasia. Over 50% of all polyp diagnoses belong to this histological subgroup, which is associated with a low risk of cancer development. Robust automatic identification of polyps belonging to this group will eliminate a significant portion of the pathologists' polyp examination workload. Using our competence and experience on deep learning in analysing scanned tissue sections, we consider this task highly feasible.

Subsequently, we will add normal and hyperplastic polyps to the definition above. The histological subtypes associated with these polyps account for more than 75% of all polyp diagnoses. An absolute requirement in identifying low-risk polyps is that none of the high-risk polyps are erroneously classified as low-risk.

The next phase includes the development of a deep learning-based system for the histology classification of the remaining polyps (after the simple cases have been identified). This phase will be more challenging. Our overall aim is to automatically identify the definite low-risk polyps (90% or more) that are associated with a low risk for cancer development and to leave the remaining polyps for the pathologists to examine thoroughly. Even if not all histology classes among the low-risk polyps can be automatically detected, automatic classification of e.g., 50-70% of all polyps without errors is still a great contribution to polyp diagnostics.

In this phase, we will introduce the deep learning system to the real gold standard, patient outcome, and combine histological classification and patient outcome in the development of a deep learning system for the estimation of risk of colorectal cancer development given the observed bowel polyps. We will identify and include patients enrolled in the UK bowel cancer screening programme who later presented with colorectal cancer and use this information together with the histological type for labelling samples during training. Polyps from patients who later developed colorectal cancer will represent increased cancer risk compared to polyps from patients who did not, all else being equal, with this approach. We will also manually re-evaluate the histological diagnosis of the polyps in this patient group to verify the diagnosis and acquire new insight into the characteristics of these polyps. Some histological classes are particularly challenging to distinguish, even for expert GI-pathologists. The approach in part II of the project may reveal knowledge that can be used to improve the consistency and accuracy of these difficult diagnoses.

H&E stained digital image of resected polyps
Digital scan of H&E stained polyps
The digital scan is divided into tiles to train the deep learning model.

Polyp classification is a project enabled from findings in the DoMore! project

 
Page visits: 382