|
|
||||||||
Critical Issues in Dental Education |
Key words: radiographic interpretation, periodontology, faculty development, dental faculty, dental hygiene faculty, educational research, student assessment
Submitted for publication 06/20/05; accepted 10/24/05
| Abstract |
|---|
|
|
|---|
Accurate and consistent identification of crestal bone and its relationship to the CEJ is important for initial diagnosis and long-term evaluation of the periodontium. Previous work has revealed inaccuracy and variability among dental school faculty in quantifying interproximal bone loss.8 This is particularly problematic in an academic setting where patients may be monitored by several dental school faculty during the course of their periodontal therapy and faculty members are responsible for teaching and assessing students radiographic interpretation.
The manner by which radiographs are viewed may impact the determination of bone loss. Inaccurate and inconsistent ratings of bone loss among clinical instructors have been reported previously using multiple computer monitors for displaying digitized radiographic images.8 This may have been a result of nonstandardized image projection via computer monitors or use of digitized radiographic images since viewing radiographs in this way is not what most clinicians are accustomed to. Most clinical instructors use a view box to view plain films in the dental setting. It may be that any viewing system that varies from what clinicians are accustomed to could affect the accuracy and consistency of their ratings. In this investigation existing plain film radiographs meeting specific criteria and digitized images of the plain films were viewed using a view box and single LCD projector, respectively. The use of a single method for projecting digitized images offered the advantage of standardized image projection during a group teaching session. Therefore, the purpose of this investigation was to determine if the method of radiographic viewing affected the accuracy and consistency of clinical instructors rating of percent bone loss.
| Methods |
|---|
|
|
|---|
The "actual" amount of bone loss was determined by three authors (SKL, HJT, and PSR), independently, on plain film radiographs via standard view box in an artificially lit room using a Schei ruler to the nearest 5 percent.10 The Schei ruler used was a plastic transparent ruler with a 2 mm thick marking at its margin and a series of equidistant lines radiating from a center point, each representing 5 percent bone loss. The ruler was placed on the tooth in question with the 2 mm thick marking at the tooths radiographic CEJ; then the ruler was moved until the last radius covered the tooths radiographic apex or in the case of teeth with multiple roots the tooths most apically positioned radiographic apex. That is, bone loss for multirooted teeth was determined using the tooths longest root. The "actual" amount of bone loss was determined by identifying the position of the alveolar bone crest relative to the rulers markings. The level of the bone crest was deemed to be at the point along the roots surface where an intact lamina dura was observed. Teeth with mesial and distal percent bone loss discrepancies were judged by the greater percentage of the two. One discrepancy in rating bone loss occurred among the authors and was discussed until consensus was reached. A computer-generated grid representing the bone loss categories as described below was created and superimposed on the teeth to verify the correct choice category on digitized images independently by two authors (SKL and HJT).
Radiographs included anterior and posterior teeth, maxillary and mandibular teeth, single- and multirooted teeth, teeth with no bone loss, and others with various amounts of bone loss (Table 1
). Teeth were grouped into four descriptive categories based on severity of bone loss. These categories included bone loss of none, <15 percent, 1530 percent, and >30 percent based on American Dental Association (ADA) and American Academy of Periodontology (AAP)1013 guidelines as outlined in the schools clinic manual for gingivitis, mild, moderate, and severe periodontitis, respectively. For the purpose of statistical analysis, numbers were assigned to each category as follows: none, <15 percent, 1530 percent, and >30 percent. The first twenty teeth were viewed by both the view box and projection system. All five teeth in the none category had 0 percent bone loss. The three teeth in category <15 percent were determined to have an actual bone loss at 10 percent. Of the six teeth in category 1530 percent, four had actual bone loss at 15 percent, one had actual bone loss at 20 percent, and one had actual bone loss at 30 percent. Three of the six teeth in category >30 percent bone loss had actual bone loss at 35 percent, two had actual bone loss at 40 percent, and one at 50 percent. The projection system used twenty-five teeth. Added teeth included one tooth at 0 percent bone loss, two at 5 percent (category <15 percent), one at 10 percent (category <15 percent), and one at 25 percent (category 1530 percent). Teeth viewed during both occasions were paired for purposes of statistical analysis.
|
The first item on the questionnaire asked clinical instructors to identify themselves as dental hygiene faculty, graduate student, or periodontal faculty. The second question asked them to quantify their years of clinical experience as <5, 510, or >10 years. The remaining questions asked clinical instructors to quantify percent bone loss for indicated teeth as none, <15 percent, 1530 percent, and >30 percent bone loss for the reasons described above. Written instructions were provided to ensure consistent viewing practices among clinician instructors. Clinical instructors were instructed to determine bone loss from 2 mm apical from the CEJ to the root apex. Teeth with mesial and distal percent bone loss discrepancies were to be rated by the greater percentage of the two. Over a three-week period, clinical instructors independently viewed radiographs and completed the questionnaire.
Four weeks later the clinical instructors viewed digitized radiographic images given in random order as a group and individually answered the questionnaire again. The two occasions of radiographic viewing were scheduled four weeks apart to optimize clinical instructors participation given their other teaching, clinical, and research responsibilities. Written and verbal instructions again ensured consistent viewing practices among clinician instructors, and percent bone loss was determined in the same manner as before. For each question, the instructors were given at least thirty seconds to record their response on the questionnaire and transmit their response via wireless remote. An audience response system using the remote was used to capture their responses, which were submitted anonymously either on paper or via wireless remote. Discrepancies between written and transmitted responses were omitted from the research database. Correct choices were presented to the instructors only after observing the radiographs by both viewing systems.
The Kappa coefficient described both agreements between the two viewing systems and accuracy (agreement with the correct choice). While sensitivity and specificity are typically used as indices of accuracy, they are not defined in situations with more than two categories. Accuracy was also measured by differences from the correct choice in two ways. One dependent variable was the difference between the clinical instructors rating and the correct choice; this variable is indicated as "difference" in all tables. This difference is thus the signed rater error and reflects net deviation from the correct choice in one direction. A positive difference indicates an overestimation of bone loss, and a negative difference indicates underestimation of bone loss. The second dependent variable used in the final analysis was the absolute value of this difference. A zero indicates a correct choice, and a positive value reflects overall deviation from the correct choice in either direction. This variable is indicated as "absolute" in all tables. Both the arithmetic difference and absolute difference are necessary because there may be zero average difference while the absolute difference is non-zero, and if there is non-zero absolute difference, it is necessary to describe the direction of the difference. Disagreement was analyzed using repeated-measured, mixed-models analysis with the following effects in the ANOVA model: three clinical instructor groups, four correct choice categories, twenty-five radiographs, two viewing systems, and all possible two-way interactions of these effects. This allowed for dependency of the ratings done by the same clinical instructor across both the multiple radiographs and two viewing systems.
Accurate ratings are consistent since they all center on the correct choice. Where ratings are not accurate, they may be consistent (centering around an inaccurate value with little variability) or they may be inconsistent (varying widely). Consistency is thus measured by the standard deviation (SD) of the ratings (square root of the squared difference between the ratings minus the mean of all the ratings provided). To look for differences in consistency, a mixed-model heterogeneous-variance analysis tested for standard deviation differences among the three clinical instructor groups, four correct choice categories, and two viewing systems.
| Results |
|---|
|
|
|---|
|
Eighteen clinical instructors (four dental hygiene, four graduate students, and ten periodontal) provided ratings using both viewing systems. Their ratings were directly compared, and agreement was 76.6 percent (Kappa=67.7 percent, SE=3 percent) (Table 2
, lower panel). The agreement between the viewing systems is nominally higher than accuracy of either of the two viewing systems (view box=70.0 percent and projected=64.5 percent).
The projection system was found to have a smaller mean difference than the view box (p=0.04). The view box least square (LS) mean difference (LS mean=0.32, SE=0.03, 95 percent CI=0.250.39) was significantly greater than the projection system difference (LS mean difference=0.25, SE=0.03, 95 percent CI=0.190.31). However, there was some evidence that this difference may not be consistent across the three groups (p=0.07). There was a significant improvement using the projection system only among the graduate students (uncorrected for multiple comparisons p-value=0.01). There was no view box versus projection system difference within the dental hygiene (p>0.90) or periodontal (p>0.50) faculty group.
More than 66.5 percent (886/1333) of all ratings agreed with the correct choice categories, but the differences varied between 2 and +3 (mean=0.22, SD=0.57). The interaction between clinical instructor group and correct choice category was the only interaction that reached statistical significance; however, this interaction was not consistent across the four correct choice categories (p=0.02). In the categories of <15 percent and 1530 percent bone loss, there was a significant difference among the three clinical instructor groups (Table 3). Periodontal faculty had significantly less error than the dental hygiene faculty (p=0.02), although this difference varied by the correct choice category (p=0.02). Overall, the amount of error varied with the amount of actual bone loss (p<0.0001). Teeth in category <15 percent had significantly larger amounts of average error (LS mean=0.51, 95 percent CI=0.43, 0.60), and teeth in category 1530 percent had significantly smaller error (LS mean=0.27 (95 percent CI=0.20, 0.34). The teeth with the least amount of error, the smallest differences as compared to the correct choice, were the teeth in category >30 percent bone loss (LS mean=0.03, 95 percent CI= 0.10, 0.04).
Overall, there was an overestimation of bone loss as indicated by positive mean differences (Table 3, difference column) for categories none, <15 percent, and 1530 percent bone loss. In the category <15 percent, 49.0 percent of view box ratings and 38.2 percent of projection system ratings were given as 1530 percent. In the category 1530 percent, 39.3 percent of view box ratings and 8.1 percent of projection ratings were given as >30 percent. That is, a slightly smaller proportion of clinical instructors overestimated (34.1 percent) bone loss using the projection system, but underestimation doubled to 17.5 percent.
The absolute difference did not vary depending upon the three groups (p=0.24), and there was no evidence for a view box versus projection system difference (p=0.69). However, the absolute difference varied depending on the correct choice (p<0.0001). The correct choice categories <15 percent and 1530 percent bone loss had the largest absolute differences, but they were not significantly different from one another (approximately 0.5 units).
Standard deviation differences depended upon the three clinical instructor groups, four correct choice categories, and viewing system (LR chi-square=264, df=12, p<.0001). That is, there was some indication that the projection system ratings were less consistent (higher variability) than the view box (chi-square=5.2, p=0.0222). From Table 3, the typical SD of the difference was 0.42 for the projection system and 0.40 for the view box. But the largest differences in consistency were between the four correct choice categories (chi-square=195, df=3, p<.0001). Category >30 percent had about half of the variability (SD
0.21) of the other correct choice categories. Category none had slightly more consistency (SD
0.42) than categories <15 and 1530 percent bone loss (SD
0.47). There was no difference in consistency between categories <15 and 1530 percent bone loss. Overall, there was a significant difference between the three rater groups (chi-square=21, df=1, p<.0001). Dental hygiene faculty were the most consistent but inaccurate, and this was most evident in the category >30 percent. The SD for dental hygiene was approximately 0.11, whereas the SD for the graduate students and periodontal faculty was twice that (SD
0.24). There was no evidence for differences in inconsistency between the three rater groups in the other three correct choice categories. That is, ignoring category >30 percent bone loss, the SD was 0.43, 0.46, and 0.45 for dental hygiene, graduate students, and periodontal faculty, respectively.
| Discussion |
|---|
|
|
|---|
Khocht et al.14 reported examiners rated more sites with bone loss using direct digital radiographs compared to conventional plain film viewing. However, Nair et al.15 and Furkart et al.16 found no differences between examiners ratings of percent bone loss using direct digital radiographs compared to conventional plain films. These studies compared examiners ratings to surgical measurements or expert consensus of the distance from the CEJ to alveolar bone crest. The "gold standard" used in this investigation was percent bone loss as determined by the Schei ruler.9,16 This technique was chosen since it has been found to be accurate in determining bone loss as compared to surgical measurement and it is efficient and easy to use.17 Study design, examiner experience, training and familiarity with viewing system, and use of direct radiography could have contributed to the differences between our results and the results cited above.1517 To our knowledge there are no studies that compare radiographs obtained through direct digital radiography to radiographs obtained through scanning plain films. The methods by which these radiographs are processed and projected are different and could affect the radiographs resolution, contract, brightness, and magnification. These qualities could in turn influence clinicians interpretations of the images. Graduate students had less error using projection system. Since graduate students were, on average, younger than periodontal and dental hygiene faculty, they might be more familiar with viewing projected digitized images because computer technology has been an integral part of their education and training.
To our knowledge, this is the first study to report on consistency among clinicians in determining percent bone loss using two different viewing systems. The differences found in this study could be due to clinicians lack of familiarity with the projection system for radiographic viewing. However, digital image preparation,18 magnification,19 and grey-scale manipulation19,20 could have contributed to differences in accuracy and consistency between the two viewing systems. It is important to note that differences between the viewing systems were not consistent across groups, and correct choice categories were confirmed by two authors (SKL and HJT) on both view box and projection system prior to clinical instructors viewing.
Overall, overestimation of bone loss was more common than underestimation. Albandar3 reported underestimation of proximal bone level using both conventional and direct digital radiography. Eickholz et al.19 reported that examiners underestimated bone loss by using digitally enhanced radiographs. However, Wolf et al.21 showed examiners overestimated proximal bone loss using direct digital radiography. The gold standard used in these studies was surgical measurement of the distance from the CEJ to the alveolar crest. Our results show a decrease in the overestimation of bone loss with underestimation doubling using the projection system compared to view box. Image magnification,19 grey-scale manipulation,15,20 and use of digitized radiographic images could have contributed to the differences between our results and theirs.
Overall, the amount of error varied with the amount of actual bone loss. A decrease in accuracy and consistency is not unexpected in the categories of <15 percent and 1530 percent bone loss since a clinical instructor can err on both sides of these middle categories. However, it may be that bone loss of <15 percent and 1530 percent is more difficult to judge than none or >30 percent. Or it may be that teeth and their corresponding amounts of actual bone loss selected for this study could have contributed to error seen in these two middle categories. In category 1530 percent, three of the four teeth with actual bone loss of 15 percent were judged inaccurately as <15 percent by up to thirteen clinical instructors, and bone loss for one tooth with actual bone loss of 30 percent was judged inaccurately as >30 percent by up to twenty-five clinical instructors. This pattern was not completely unexpected since actual bone loss was at the border of categories used in this study; however, other observations were less expected. For example, five teeth with 5 percent bone loss were judged inaccurately, and bone loss was overestimated as 1530 percent by up to nineteen clinical instructors.
Teeth with the same actual bone loss did not always have similar accuracy rates. For example, in category none, tooth #29 had a 97.0 percent accuracy rate yet tooth #19 had a 3.1 percent accuracy rate. Analyses were done to look for significant differences in rating percent bone loss for single versus multirooted teeth, anterior versus posterior, and maxillary versus mandibular; however, none were found. It stands to reason that since teeth are highly variable in their root length, shape, and form, the perceived percent bone loss may be very different from the actual bone loss as measured by the Schei ruler.
There were no differences in rater error between groups for teeth in categories none and >30 percent bone loss; however, in categories <15 percent and 1530 percent, periodontal faculty had significantly less error than dental hygiene faculty regardless of the viewing system used. Periodontal faculty members have more extensive training and clinical responsibilities than dental hygienists; as diagnosticians, they are more accustomed to assessing and quantifying bone loss as they diagnose periodontal diseases.
Rater error could have arisen from clinical instructors rating bone loss from a distance less than or greater than 2 mm apical from the CEJ, failure to recognize anatomical landmarks,22 or inability to judge varying percents of bone loss relative to the tooths root length. Radiographic quality, indistinguishable periodontal ligament space, alveolar crest, or root apex, bone density, and trabecular pattern could have also contributed to inaccuracy in radiographic interpretation.
Inaccuracies and inconsistencies among clinical instructors in determining percent bone loss were observed in this study. Clinical instructors accuracy was 93.8 percent in rating of percent bone loss for teeth in category >30 percent bone loss, yet accuracy dropped to between 49.0 and 55.1 percent for teeth in categories <15 and 1530 percent bone loss. Additionally, consistency among clinicians was less for these middle categories than for categories none and >30 percent bone loss. Inaccuracies and inconsistencies among clinicians are ubiquitous problems in both medicine and dentistry.2335 It is somewhat expected to have inconsistencies among clinical instructors when there are a number of subjective elements that go into making a clinical decision; this may be perfectly acceptable if decisions are based on evidence or accepted practice guidelines. However, determinations of bone loss are based on relationships between anatomical factors, which can actually be measured. Therefore, determining percent bone loss is less subjective than interpretation of other clinical findings that can not be directly measured, and inconsistencies among clinical instructors in this area are less expected and acceptable.
In the undergraduate teaching program, categories of bone loss used in this investigation (none, <15 percent, 1530 percent, and >30 percent) help establish diagnoses of gingivitis, mild, moderate, and severe periodontitis, respectively. This is not to say that radiographic bone loss is the only factor used to mark the difference between gingivitis or periodontitis or establish the severity of periodontitis. Other clinical findings such as deep periodontal pockets, advanced attachment loss, furcation involvement, and mobility often accompany increased bone loss and need to be considered in determining severity of disease. However, bone loss categories are designed to make clinical instructors and students aware of and sensitive to all diagnostic findings and potential treatment needs. Progression of bone lossfrom <15 percent to between 1530 percent, for examplecarries with it the potential need to plan for more complex treatment and/or specialty referral in order to achieve therapeutic success. Furthermore, assessment of radiographic bone loss can be used to determine the results of therapy and need for further treatment. Greenstein and Caton state that "the only subjective ways to monitor PDA (periodontal disease activity) are longitudinal assessments of probing attachment levels and radiographs."36
Inaccuracies and inconsistency of radiographic viewing among clinical instructors may be particularly problematic in a dental school setting where patients are assessed and treated by multiple clinicians. This could lead to errors in establishing diagnoses and prognoses, over- or undertreatment, and increased treatment time and cost. Inaccuracies and inconsistency among clinical instructors may also be problematic in the teaching of radiographic interpretation and determining bone loss. Students may learn to determine bone loss incorrectly or be quite confused by varying ratings among their clinical instructors, making it more difficult to relate radiographic findings to clinical findings and manage patients. In education, students are generally assessed by their ability to generate the "correct answer." Their answer is usually compared to the opinion of the said expert (i.e., the clinical instructor). If clinical instructors opinions are constantly changing, then the ability to judge student performance is lost.
Faculty development sessions focusing on review of anatomical landmarks and determining percent bone loss could enhance accuracy and consistency of radiographic interpretation among clinical instructors. This may lead to enhancement of patient management, teaching, student learning, and assessment of radiographic interpretation. Our results show slight differences in accuracy for rating percent bone loss using view box and projection system. Consistency among clinical instructors varied depending on the viewing system used. It is expected that as clinicians become familiar with the projection system for radiographic viewing, inconsistency among them will be comparable for the two systems. Viewing projected digitized radiographs by LCD projector offers several advantages in the educational setting. Many clinicians can view the same radiograph at the same time, making it easier to point out and discuss anatomical landmarks, root length and form, and rationale for rating bone loss. A computer-generated grid could be superimposed over the radiograph indicating the actual category of bone loss. Radiographic quality may be enhanced by computerized image manipulation, and storage of teaching material is less cumbersome with a digitized method. Furthermore, the methods used for processing and projecting digitized radiographic images are readily available at most institutions.
There were limitations with the model used here. Digitized radiographs were scanned using a relatively low resolution and displayed by a fixed-pixel projector. In comparing radiographs processed and displayed using these methods to plain films displayed on a view box, there may be differences in radiographs resolution, contrast, and potential for grey-scale manipulation and magnification that could affect image quality. This in turn could impact clinicians interpretations of percent bone loss. However, it is important to note that radiographic categories were confirmed independently by two author clinicians (SKL and HJT) prior to clinical instructors viewing of radiographs by LCD projector. The overall accuracy rate (agreement with correct choice obtained using the Schei ruler technique on plain films) was 85.0 percent for the periodontal faculty group viewing projected digitized images, which shows accuracy could be obtained by LCD viewing. The image quality of digitized radiographs could have been enhanced by scanning plain films at higher resolution and/or projecting radiographs at higher resolution. However, having clinical instructors view digitized radiographic images using a single high resolution laptop, for example, does not offer the advantages outlined above for group teaching. A better quality radiograph would have been produced by direct digital radiography as compared to the method used for process radiographs used here. However, direct digital radiography is not readily available at all institutions; and obtaining radiographs in this way and via the conventional method so that comparisons could be made between the two radiographic techniques would mean exposing patients to radiation twice. Additionally, Khocht et al. reported that radiographs obtained through the direct digital method and those obtained through the conventional method were not the same in that the former had a higher number of sites with bone loss.14 Other limitations of our investigation include the use of nonstandardized plain films and hence digitized radiographic images involving their exposure, angulations, and composition. One could argue, however, that those characteristics reflect real clinical situations and may not have resulted in a significant limitation. Digital image manipulation and magnification were not standardized for all radiographs. Also, the presence of 71.2 percent of teeth with actual bone loss of 15 percent or 30 percent could have contributed to greater inaccuracies and inconsistencies among instructors seen in the middle two categories. Furthermore, clinical instructors could have discussed the radiographs and rating of percent bone loss with one another throughout the course of the study, and hence "cross-talk" among raters could have influenced results. As further studies are developed, these limitations should be taken into consideration.
| Conclusion |
|---|
|
|
|---|
|
| Acknowledgments |
|---|
| Footnotes |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. K. Lanning, A. M. Best, H. J. Temple, P. S. Richards, A. Carey, and L. K. McCauley Accuracy and Consistency of Radiographic Interpretation Among Clinical Instructors in Conjunction with a Training Program J Dent Educ., May 1, 2006; 70(5): 545 - 557. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |