Evaluation of serum-based cancer biomarkers: A brief review from a clinical and computational viewpoint

https://doi.org/10.1016/j.critrevonc.2014.10.002Get rights and content

Abstract

Biomarkers are vital to detect diseases in various clinical stages. A variety of cancer serum biomarkers are already known, while for more accurate cancer-type detection, there required more rigorous evaluation manners, especially computational evaluation measures, for biomarkers. In this review, we first show three typical pitfalls in finding biomarkers and their examples, after briefly presenting standard five clinical biomarker screening phases by National Cancer Institute. We then introduce current computational biomarker evaluation measures, including current, standard methods with their intrinsic features. We further show an up-to-date list of existing cancer serum biomarkers, pointing out several issues, being caused by the limitations of current biomarker evaluation approaches. Finally we discuss the current attempts to develop new, statistically robust, computational serum-based biomarker measures in terms of specificity to each of various cancer types.

Introduction

Biomarkers are believed to increase the accuracy of diagnosis to precisely characterize the disease in a diagnostic or prognostic level. Biomarkers predict the response of the patient, helping to guide a more tailored treatment for the patient. Serum biomarkers are more appealing due to their simplicity of obtaining the blood samples. There are several serum cancer biomarkers, which are routinely used in clinical oncology, e.g. prostate specific antigen (PSA) for prostate cancer and cancer antigen (CA)-125 for ovarian cancer. However, their applications have significant limitations, because of low specificity, i.e. small probability of samples with no biomarkers in all non-diseased samples. In fact, the issue of specificity has become much more acute, since more than 30% or higher circulatory PSA level patients have to go for extensive testing and treatment, indicating its lack of specificity of prostate tumor detection [1]. In summary, lack of specific serum biomarkers has impeded the change in morbidity and mortality in cancer patients.

The traditional “a priori” approach for biomarker development needs a well-established biological procedure, being subjected to two-step clinical validation: (1) simple test with a high level of quality control, and (2) planned statistical prospective evaluations within the validation pilot studies to prove an established clinical impact [2]. Contrarily, more recently “a posteriori” approaches evaluate the clinical rationale of a “biological indicator” through a systematic discovery of various screening tools (e.g. microarray, bioinformatics, High-throughput DNA sequencing). These biological instruments are “black boxes”, meaning that a clinical usage can be discovered through research pilot studies. Computational approaches give possible candidates for detecting certain diseases, by “sensitivity” and “specificity”, within a patient population, but the proper quantification of a single biomarker in serum is limited to the evaluation technicalities. Therefore this review will focus on the latter approach since the recent technologies provide a plethora of potential candidates which are in proper need of evaluation.

Over the past twenty years, biomarkers have shown significant promise in the mechanism of how it will transform a patient's treatment. Therefore, biomarker research has been aimed towards the development of personalized targeted therapy. Despite the recent technological advancements, there are still relatively few biomarkers that are in routine clinical use today [3]. With a growing number of complex genomic tests for biomarker signatures becoming commercially available, the promise of personalized medicine is fast becoming a reality. Much attention has to be placed on the reason why the promising biomarkers and the biomarkers signatures entering the clinic is a long road ahead [4].

Section snippets

Biomarker discovery validation: Three pitfalls

In this section, we first briefly show the most widely accepted guideline for evaluation and validation of biomarkers (Diagram 1): “Early Detection Research Network (EDRN)” developed by National Cancer Institute [5]. We then explain typical pitfalls and their examples of clinical biomarker evaluation failures, mainly caused by poor experimental design and inappropriate choice of the diagnostic assay.

Phase I of EDRN is the discovery of biomarkers through knowledge—based gene selection, gene

Computational measures for evaluating biomarkers

Diagram 2 is a 2 × 2 contingency table, showing cross-correlate disease status with biomarker presence. This table is obtained, under a pre-determined threshold for the biomarker test, to examine the usefulness of the candidate biomarker in the diagnostic, prognostic, predictive appraisal of the disease. This table has four cells with samples labeled by True Positive (TP), False Negative (FN), False Positive (FP) and True Negative (TN), which are mainly used to compute the measures introduced

Current biomarkers

Table 1 provides a comprehensive comparison of current potential and approved biomarkers, including 26 biomolecules, 2 metabolic biomarkers and 4 cell biomarkers. A unique feature of this table is positive and negative predictive values and DOR, which are extracted from corresponding references and attached to all biomarkers. Furthermore, one marker can be used for evaluating more than one cancer type. For example, heat shock proteins (HSPs) are used for five types: gastric, prostate,

Discussion

As explained in Section 3.8, OVA1 (approved in 2009) for ovarian carcinoma with five biomarkers including CA-125 had to show weaker performance in terms of specificity and positive predictive value than CA-125 alone (approved in 1997). This implies the need of improved statistical quantification methods to decrease the lack of specificity in the current instrumentation methods for low abundance proteins. This point brought a mentality shift focusing on a more robust statistical predictive tool,

Conflicts of interest

Disclosure of potential conflicts of interest.

The authors declare that they have no potential conflict of interest disclosed.

Funding sources

This work was partially supported by JSPS KAKENHI #2430054 and #26-381. S.Y. is supported by Grant-in-Aid for JSPS Fellows.

Funding disclosures

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Reviewers

Professor Christoph Thomssen: Martin-Luther-University Halle-Wittenberg, Gynaecology, University Hospital Halle, Ernst-Grube-Strasse 40, D-06097- Halle an der Saale, Sachsen-Anhalt, Germany.

Dr Dan Ruderman: Department of Medicine, USC Keck School of Medicine, 2250 Alcazar St., IGM (CSC) 243, Los Angeles, CA 90033, USA.

Acknowledgments

We acknowledge stimulating discussions with Dr. Ajit Bharti (Boston University, USA) concerning the importance of evaluation techniques of biomarkers and how it plagues the potential candidates in the pipeline. We would also like to thank Dr. Edda Klipp (Humboldt Universitat zu Berlin, Germany) for her thoughtful review and comments on many points in this manuscript.

We are grateful to the Editor as well as the two reviewers for their very helpful comments.

Sohiya Yotsukura is a PhD candidate of the School of Pharmaceutical Sciences at Kyoto University. Her research interests include computational biology and machine learning techniques to analyze the functional impact of SNPs and tumorigenic somatic mutations on biological systems.

References (107)

  • Y.U.S. Sekijima et al.

    High prevalence of wild-type transthyretin deposition in patients with idiopathic carpal tunnel syndrome: a common cause of carpal tunnel syndrome in the elderly

    Hum Pathol

    (2011)
  • R.L.R. Barroso-Sousa et al.

    Decreased levels of alpha-1-acid glycoprotein are related to the mortality of septic patients in the emergency department

    Clinics (Sao Paulo)

    (2013)
  • X.C.K.M. Badoux et al.

    Cyclophosphamide, fludarabine, alemtuzumab, and rituximab as salvage therapy for heavily pretreated patients with chronic lymphocytic leukemia

    Blood

    (2011)
  • V.I.R.D. Seliger et al.

    The predictive potential of the sweat chloride test in cystic fibrosis patients with the G551D mutation

    J Cyst Fibros

    (2013)
  • W.L.C.G. McGuire

    Prognostic factors and treatment decisions in axillary-node-negative breast cancer

    N Engl J Med

    (1992)
  • A.K. Fuzery et al.

    Translation of proteomic biomarkers into FDA approved cancer diagnostics: issues and challenges

    Clin Proteomics

    (2013)
  • D. EP

    The failure of protein cancer biomarkers to reach the clinic: why, and what can be done to address the problem?

    BMC Med

    (2012)
  • M.S.E.R. Pepe et al.

    Phases of biomarker development for early detection of cancer

    J Natl Cancer Inst

    (2001)
  • M.S.S. Kumar

    Biomarkers of diseases in medicine

    Curr Trends Sci

    (2009)
  • E.K.K. Drunker

    Pitfalls and limitations in translation from biomarker discovery to clinical utility in predictive and personalised medicine

    EPMA J

    (2013)
  • S.E. Kern

    Why your new cancer biomarker may never work: recurrent patterns and remarkable diversity in biomarker failures

    Cancer Res

    (2012)
  • G. HB

    Are biomarkers for bladder cancer beneficial?

    J Urol

    (2010)
  • Y.S.Z. Xu et al.

    Lysophosphatidic acid as a potential biomarker for ovarian and other gynecologic cancers

    JAMA

    (1998)
  • D. EP

    Cancer biomarkers: can we turn recent failures into success?

    J Natl Cancer Inst

    (2010)
  • D.L. Baker et al.

    Plasma lysophosphatidic acid concentration and ovarian cancer

    JAMA

    (2002)
  • E.K.K. Drucker

    Pitfalls and limitations in translation from biomarker discovery to clinical utility in predictive and personalised medicine

    EPMA J

    (2013)
  • S.B.F. Lewington et al.

    A review on meta-analysis of biomarkers: promises and pitfalls

    Clin Chem

    (2012)
  • A.J. Buckler et al.

    A novel knowledge representation framework for the statistical validation of quantitiative imaging biomarkers

    J Digit Imaging

    (2013)
  • D. EP

    Early prostate cancer antigen-2 (EPCA-2): a controversial prostate cancer biomarker?

    Clin Chem

    (2010)
  • D.H.H. Bohning et al.

    A limitation of the diagnostic-odds ratio in determining an optimal cut-off value for a continuous diagnostic test

    Stat Methods Med Res

    (2011)
  • J. Attia

    Moving Beyond sensitivity and specificity: using likelihood ratios to help interpret diagnostic tests

    Aust Prescr

    (2003)
  • B.S.S. Grund

    Analysis of biomarker data: logs, odds ratios and ROC curves

    Curr Opin HIV AIDS

    (2010)
  • P. Sonego et al.

    ROC analysis: applications to the classification of biological sequences and 3D structures

    Brief Bioinform

    (2008)
  • D.G.P.C. Warnock

    A roadmap for biomarker qualification

    Nat Biotech

    (2010)
  • Y.C.T. Zheng et al.

    Application of the time-dependent ROC curves for prognostic accuracy with multiple biomarkers

    Biometrics

    (2005)
  • M.J. Pencina et al.

    Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond

    Stat Med

    (2008)
  • J.A.M.B. Hanley

    The meaning and use of the area under a receiver operating characteristic (ROC) curve

    J Radiol

    (1982)
  • D.A.S.K. Grimes

    Making sense of odds and odds ratios

    Obstet Gynecol

    (2008)
  • M.S.S. Lotrakul et al.

    Reliability and validity of the Thai version of the PHQ-9

    BMC Psychiatry

    (1998)
  • W.F.B. Jiang et al.

    Biomarker adaptive threshold design: a procedure for evaluating treatment with possible biomarker-defined subset effect

    J Natl Cancer Inst

    (2007)
  • J.E.B.L. Fischer et al.

    A readers’ guide to the interpretation of diagnostic test properties: clinical example of sepsis

    J Intensive Care Med

    (2003)
  • M.S.F.Z. Pepe et al.

    Integrative the predictiveness of a marker with its performance as a classifier

    Am J Epidemiol

    (2007)
  • M.S.J.H. Pepe et al.

    Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker

    Am J Epidemiol

    (2004)
  • J. Abraham

    OVA1 test for preoperative assessment of ovarian cancer

    Community Oncol

    (2010)
  • F.R.D.C. Ueland et al.

    Effectiveness of a multivariate index assay in the preoperative assessment of ovarian tumors

    Obstet Gynecol

    (2011)
  • Chudeka-Glaz AC-PA et al.

    Preoperative diagnostic performance of ROMA (Risk of Ovarian Malignancy Algorithm) in relation to etiopathogenesis of epithelial ovarian tumors

    J Mol Biomark Diagn

    (2013)
  • M.D.E. Montagnana et al.

    The ROMA (Risk of Ovarian Malignancy Algorithm) for estimating the risk of epithelial ovarian cancer in women presenting with pelvic mass: is it really useful

    Clin Chem Lab Med

    (2011)
  • E.S.Q. Xu et al.

    Association of mitochondrial DNA copy number in peripheral blood leukocytes with risk of esophageal adenocarcinoma

    Carcinogenesis

    (2013)
  • L.X.J. Chen et al.

    Identifying cancer biomarkers by network-constrained support vector machines

    BMC Syst Biol

    (2011)
  • T. Gilligan

    The new data on prostate cancer screening: what should we do now?

    Cleve Clin J Med

    (2009)
  • Cited by (0)

    Sohiya Yotsukura is a PhD candidate of the School of Pharmaceutical Sciences at Kyoto University. Her research interests include computational biology and machine learning techniques to analyze the functional impact of SNPs and tumorigenic somatic mutations on biological systems.

    Hiroshi Mamitsuka is a professor at the Institute for Chemical Research in Kyoto University, jointly appointed as a professor at the School of Pharmaceutical Sciences of the same university. His research interests include machine learning, data mining and their applications in bioinformatics and chemoinformatics.

    View full text