eLetters is an online forum for ongoing peer review. To submit an eLetter please go to the article you wish to respond to and click on the link that reads "eLetters: Submit a Response." Submission of eLetters are open to all health care professionals and experts in related fields.

eLetters to:

REVIEW ARTICLE:
Alex R. Kemper, David K. Wallace, and Graham E. Quinn
Systematic Review of Digital Imaging Screening Strategies for Retinopathy of Prematurity
Pediatrics 2008; 122: 825-830 [Abstract] [Full text] [PDF]
*eLetters: Submit a response to this article

eLetters published:

[Read eLetters] Telemedicine is Highly Effective When Screening for Referral-Warranted Retinopathy of Prematurity
Darius M. Moshfeghi, Kimberly Drenser, Michael T. Trese, and Antonio Capone, Jr.   (23 December 2008)
[Read eLetters] Design and analysis of ophthalmic studies
Mamtha Balasubramaniam, Antonio Capone Jr, Kimberly Drenser, and Michael T. Trese   (13 February 2009)

Telemedicine is Highly Effective When Screening for Referral-Warranted Retinopathy of Prematurity 23 December 2008
 Next eLetters Top
Darius M. Moshfeghi,
Vitreoretinal Surgeon
Stanford Universtiy,
Kimberly Drenser, Michael T. Trese, and Antonio Capone, Jr.

Send letter to journal:
Re: Telemedicine is Highly Effective When Screening for Referral-Warranted Retinopathy of Prematurity

dariusm{at}stanford.edu Darius M. Moshfeghi, et al.

Drs. Kemper, Wallace, and Quinn reviewed the application of telemedicine for retinopathy of prematurity (ROP) screening and concluded that “insufficient data to recommend that retinal imaging be adopted by NICUs” based largely on claims that “retinal imaging would almost surely miss some cases of sight-threatening ROP.”(1) In examining the data, we come to the opposite conclusion—telemedicine is a highly effective mechanism for identification of sight-threatening ROP when used in conjunction with confirmatory bedside binocular indirect ophthalmoscopy (BIO).

How can it be that two groups come to such disparate conclusions? The short answer is we and the authors of the present study are hoping to achieve different goals through telemedicine screening for ROP. Is the goal of telemedicine screening to identify all ROP present in the eye or is it to identify ROP that would benefit from bedside BIO by an ophthalmologist “who has sufficient knowledge and experience to enable accurate identification and location and sequential retinal changes of ROP.”(2) We hold to the latter definition. A properly designed screening test includes a threshold above which an action will be taken. In the case of telemedicine screening for ROP, the threshold would be referral to an ophthalmologist experienced in ROP for a BIO examination.

Presently, national guidelines in the U.S.A. have two components on the actual physical aspects of screening: 1) the technique should be binocular indirect ophthalmoscopy and 2) the examiner should be an “ophthalmologist who has sufficient knowledge and experience to enable accurate identification and location and sequential retinal changes of ROP.”(2)

In their analysis, Kemper et al preferentially included studies that included “referral-warranted ROP” as defined by Ells, et al, as the gold standard.(3) Unfortunately, numerous design flaws were included in the trials analyzed. The first two papers share following three flaws for applicability to telemedicine screening for ROP today:

1. They do not include specific identification of referral-warranted ROP.

2. They use an outmoded contact RetCam 120 hand piece that was never intended for ROP screening and was incompatible with the specula used, thereby precluding adequate visualization of the fundus, ably depicted in Figure 1 (Yen, et al) and Figure 3 (Roth, et al) and discussed in both papers as a flaw.

3. They only evaluated an infant a maximum of twi time points in order to make a determination of the presence of ROP and its ability to predict future advanced disease. (4, 5)

Furthermore, in the paper by Yen, et al, “masked readers” were utilized, but no mention is made of their specific experience in ROP or their previous experience in evaluation of ROP images for screening purposes.(4) The former is a stipulated requirement of the national screening guidelines,(2) while the second is a recognized learning curve for the technique of telemedicine.

Most importantly, the authors of the papers included in the analysis by Kemper et al took the unusual approach of looking at the images in a vacuum—two non-sequential points in time—to ascertain the presence of any stage of ROP (not referral-warranted) and for presence of pre-threshold or threshold.(4, 5) This is not how telemedicine is employed here in the U.S. and throughout the world, where the technique involves weekly, sequential, longitudinal viewing to ascertain the presence of disease and its rate of progression. Frankly, while interesting from a purely hypothetical perspective, it is akin to allowing a bedside BIO examiner to make an interpretation at only two time points, which is not in adherence with the national guidelines regarding disease activity.(2) A careful reading of the Yen paper reveals that they used two screening points not “…to test any particular screening strategy itself, but simply to determine the amount of useful information that could be gleaned from the images in predicting outcome.”(6) Hence, the data results focused on the ability to predict pre-threshold and threshold, as opposed to actual detection. The Roth paper similarly focused on the detection of ROP, as opposed to referral-warranted ROP.(5) This paper, included 59 eyes of 32 patients for a total of 100 simultaneous BIO and photographic examinations.(5) This leads to a situation where a maximum of 2 examinations per eye were performed in order to make a determination of the disease state, something that is not recommended in clinical practice.(2) This paper does not meet the authors stated goals of including “referral-warranted ROP”, as no mention is made of the need for treatment or predictive ability for referral-warranted disease.

Three papers by Chiang, et al, are included for review, all three involving data obtained from the Jackson Memorial Hospital from January 1, 1999, to December 31, 2000.(7-9) This is one year after the study reported by Roth, et al, at the same institution and a similar author (JTF)—and all three papers report using the same equipment at the same institution that was found to be deficient for evaluating the fundus by Roth, et al, and Yen, et al.(5-9) Additionally, the central weakness of these three papers is that the three examiners in each paper admittedly were inexperienced in ROP screening in general,(7-9) and telemedicine in particular, again not adhering to national screening guidelines,(2) making the data difficult to interpret. It is interesting to note that when 3 trained, experienced ROP screeners are utilized, the sensitivity rose to 100% for infants requiring treatment in a follow up study by Chiang, et al.(10)

Finally, the one paper that performed prospective longitudinal telemedicine ROP screening in accordance with inclusion criteria stated by the authors of the present paper reported a sensitivity of 100% and specificity of 96% for identification of referral-warranted ROP.(3)

The experience of one of the authors herein has been that in 3 years, no case of referral-warranted or treatment-warranted ROP has been missed in the Stanford University Network of Retinopathy of Prematurity (SUNDROP) telemedicine network.(11, 12) These reports are retrospective in nature, however there has been 100% capture of all infants discharged from the hospital with office BIO by the same experienced ROP examiner who performed the telemedicine screening (DMM) with no adverse anatomic or functional outcomes. For 3 years SUNDROP has provided telemedicine screening for ROP to 4 separate NICUs served by a central reading center and has demonstrated the ability of telemedicine to serve as an arbiter of which babies at risk for ROP need an immediate confirmatory bedside BIO examination.

Therefore, based upon the studies detailed above that meet the stated entry criteria in the paper by Kemper et al1, and upon our own extensive experience (11,12), it is our position that for the stated goal of identifying ROP patients that would benefit from BIO, telemedicine has met the requirements of an effective screening mechanism for ROP.

REFERENCES

1. Kemper AR, Wallace DK, Quinn GE. Systematic review of digital imaging screening strategies for retinopathy of prematurity. Pediatrics 2008;122:825-830.

2. Screening examination of premature infants for retinopathy of prematurity. Pediatrics 2006;117:572-576.

3. Ells AL, Holmes JM, Astle WF, et al. Telemedicine approach to screening for severe retinopathy of prematurity: a pilot study. Ophthalmology 2003;110:2113-2117.

4. Yen KG, Hess D, Burke B, Johnson RA, Feuer WJ, Flynn JT. Telephotoscreening to detect retinopathy of prematurity: preliminary study of the optimum time to employ digital fundus camera imaging to detect ROP. J AAPOS 2002;6:64-70.

5. Roth DB, Morales D, Feuer WJ, Hess D, Johnson RA, Flynn JT. Screening for retinopathy of prematurity employing the retcam 120: sensitivity and specificity. Arch Ophthalmol 2001;119:268-272.

6. Yen KG, Hess D, Burke B, Johnson RA, Feuer WJ, Flynn JT. The optimum time to employ telephotoscreening to detect retinopathy of prematurity. Trans Am Ophthalmol Soc 2000;98:145-150; discussion 150-141.

7. Chiang MF, Starren J, Du YE, et al. Remote image based retinopathy of prematurity diagnosis: a receiver operating characteristic analysis of accuracy. Br J Ophthalmol 2006;90:1292-1296.

8. Chiang MF, Keenan JD, Starren J, et al. Accuracy and reliability of remote retinopathy of prematurity diagnosis. Arch Ophthalmol 2006;124:322-327.

9. Chiang MF, Keenan JD, Du YE, et al. Assessment of image-based technology: impact of referral cutoff on accuracy and reliability of remote retinopathy of prematurity diagnosis. AMIA Annu Symp Proc 2005;126- 130.

10. Chiang MF, Wang L, Busuioc M, et al. Telemedical retinopathy of prematurity diagnosis: accuracy, reliability, and image quality. Arch Ophthalmol 2007;125:1531-1538.

11. Murakami Y, Jain A, Silva RA, Lad EM, Gandhi J, Moshfeghi DM. Stanford University Network for Diagnosis of Retinopathy of Prematurity (SUNDROP): 12-month experience with telemedicine screening. Br J Ophthalmol 2008;92:1456-1460.

12. Silva RA, Murakami Y, Jain A, Gandhi J, Lad EM, Moshfeghi DM. Stanford University Network for Diagnosis of Retinopathy of Prematurity (SUNDROP): 18-month experience with telemedicine screening. Graefes Arch Clin Exp Ophthalmol 2009;247:129-136.

Conflict of Interest:

Scientific Advisory Board, Clarity Medical Systems Inc. (Dublin, CA; makers of the RetCam family of widefield digital cameras—DMM (paid Q4 2006-Q1 2008, unpaid Q2 2008-present), MTT

Design and analysis of ophthalmic studies 13 February 2009
Previous eLetters  Top
Mamtha Balasubramaniam,
Senior Biostatistician
William Beaumont Hospital,
Antonio Capone Jr, Kimberly Drenser, and Michael T. Trese

Send letter to journal:
Re: Design and analysis of ophthalmic studies

mbalasubramaniam{at}beaumont.edu Mamtha Balasubramaniam, et al.

Published article:

 

 

Design and analysis of ophthalmic studies

 

Mamtha Balasubramaniam, MSa, Antonio Capone Jr, MDb, Kimberly Drenser, MD, PhDb, Michael T. Trese, MDb

 

a Sr Biostatistician

  Division of Biostatistics

  William Beaumont Hospital Research Institute

  Royal Oak, Michigan

 

b Associated Retinal Consultants PC

  Royal Oak, Michigan

 

 

 

Corresponding Author:  

 

Mamtha Balasubramaniam, MS

William Beaumont Hospital Research Institute

Division of Biostatistics

3911 West 13 Mile Rd

Royal Oak, MI 48073

 

Tel: (248) 551-5072

Fax:  (248) 551-9591

E-mail: mbalasubramaniam@beaumont.edu

 

 

The design and analytical issues raised by Kemper et al1 are generally true and applicable to all ophthalmic data, not just retinopathy of prematurity (ROP) data.  The consequence of ignoring the association between both eyes of a patient impact a study depending on how it’s analyzed.  (1)  Studies where data from both eyes are combined by adding them (i.e. treating the two eyes from each patient as separate cases) are based on the underlying (but erroneous) assumption that both eyes are independent.  Doing this artificially doubles sample size and decreases standard error, and therefore makes differences statistically significant when in reality they’re not.  (2)  Those that compare the outcomes of the left eye to that of the right eye in an unpaired fashion will artificially inflate the standard errors and therefore make it harder to find significant differences (i.e. reject the null hypothesis) in outcomes if they truly exist.  (3)  As a compromise, those that analyze data from each eye separately (without combining them) do not violate the assumptions of the statistical tests.  In these cases, as long as the results and inferences are presented for each eye, the errors are minimal.  Presenting the data in this manner is acceptable especially when the treatment course of action for each eye varies.  However, all three commonly used approaches fail to capture what added impact the inherent association between the two eyes has on the differences in outcome.  Keeping this in mind, it is unclear to what purpose the authors’ computation of 95% confidence intervals (CI) (when not given) serves, whether it is on combined data or not.  It also appears that they used the traditional calculations of the 95% CI.  For computations like sensitivities, specificities, etc. based on binomial data, the Score confidence intervals provide much better coverage2.  Nor have they differentiated between clinical and non-clinical/quality control outcomes as the corresponding analytical approach varies.  The PHOTO-ROP3 trial did in fact present results and make inferences based on data for each eye.  Furthermore, they concluded that RetCam® could not supplant bedside examinations.  These conclusions were consistent for each eye.

 

What makes ROP data particularly challenging is the symmetric nature of the disease.  And by that we mean, if it develops in one eye, 80%-85% of the time it will develop to the same degree of severity in the fellow eye which is consistent with the authors’ statements about the association.  The table below presents the general framework within which ophthalmic (including ROP) data are obtained for many research studies and some of the more appropriate statistical techniques that should be used to analyze them.

 

Study Design

Only one eye

Both eyes

One or both eyes

Cross-sectional

Tests of independence like Chi-square tests, Fisher’s Exact test, Student’s t-test, linear/logistic regression, etc.

Statistical tests for paired data like paired t-test, McNemar’s test, Kappa statistics, conditional logistic regression, etc.

Analytical techniques based on clustered data like analysis of covariance using Randomized Complete Block Designs (RCBD), etc.

Longitudinal

Analytical techniques for data with association present only in one-dimension (i.e. over time) like repeated measures ANOVAa or GEEb models, etc.

Analytical techniques for clustered data based on paired differences or averages with association present in two dimensions (i.e. between each pair of eyes at each time point, and over all time points) like repeated measures ANOVAa, mixed models like GEEb, MCMCc models, hierarchical/multilevel models, etc.

Analytical techniques for clustered data based on the association present in one or two dimensions (i.e. between both eyes at each time point and then over all time points) like repeated measures ANOVAa, mixed models like GEEb, MCMCc models, hierarchical/multilevel models, etc.

aAnalysis of Variance bGeneralized Estimating Equations cMarkov Chain Monte Carlo

 

While interest among pediatricians lie at the patient-level, those of ophthalmologists are at the eye-level and eventually how it impacts patient-care.  The study-design recommendations made by the authors are those that need to be dealt with while analyzing ophthalmic data.  While most ophthalmic studies are well designed from a clinical and statistical perspective, they often are not analyzed using the right statistical techniques that take into account the inherent association between the left and right eyes of a patient, or for that matter, multiple procedures performed on the same eye of a patient or varying number of visits per patient.  Association in ophthalmic data typically occurs on two levels in a hierarchical structure – at the eye-level and at the patient-level, with the eyes nested within a patient.  Furthermore, when these studies are longitudinal in design, repeated measurements on outcomes or predictors obtained over time from each patient for each eye are correlated over time as well.  And these primary and/or secondary outcome(s) repeatedly measured are typically categorical (and not continuous).  Hence analytical techniques need to capture the effect that both patient-level and eye-level data have on the outcome as it impacts patient care.  This can be effectively accomplished by taking a modeling approach and constructing appropriate mixed models like GEE and/or multilevel models and their variations.  In this regard, the authors are not entirely correct in their recommendations for future studies to look at patient-level data instead of eye-level data.

 

Much statistical theoretical research has been conducted in this area over the last decade or two, and currently, there are many commercially available statistical software packages like SAS®, mLWin®, S-Plus®, etc. and R® (free and open source) that can efficiently handle this type of sophisticated modeling that is required to answer these clinically important questions.

 

 

References:

 

1         Kemper AR, Wallace DK, Quinn GE. Systematic review of digital imaging screening strategies for retinopathy of prematurity. Pediatrics 2008;122:825-830.

2         Agresti A and Coull BA. 1998. Approximate is better than exact for interval estimation of binomial parameters. Amer. Statist. 52: 119-126.

3         Photographic Screening for Retinopathy of Prematurity Cooperative Group. The Photographic Screening for Retinopathy of Prematurity Study (Photo-ROP): primary outcomes. Retina. 2008;28 :S47 –S54.

Conflict of Interest:

None declared