Inter-observer Variance and Patient Heterogeneity Influencing The Treatment of Grade I Spondylolisthesis.

2020 
Abstract Background Context Despite well done randomized clinical trials, the role of fusion as an adjunct to decompression for the treatment of patients with degenerative spondylolisthesis remains controversial. There is substantial variation in the use of fusion as well as the techniques used for fusion for a population of patients all described by a single ICD10 code. Purpose We sought to investigate the source of the variation in the perceived role of fusion by looking at surgeon as well as patient specific factors. Study Design Prospective cohort study examining the variability of recommendations from an expert panel of surgeons based imaging and clinical vignettes. Patient Sample Patients with degenerative spondylolisthesis and stenosis Outcome Measures A six-category treatment schema based on level of invasiveness of proposed surgeries with one through three representing non-fusion strategies and categories four through six representing fusion strategies. Methods The authors are conducting the ongoing spinal laminectomy versus instrumented pedicle screw (SLIP) II study in which patients with grade one degenerative spondylolisthesis and stenosis are randomized to two groups: a review group in which patients are treated as per recommendations of an expert panel and a non-review group in which patients are treated as per the referring surgeon's preference. In the former (review group), clinical vignettes and radiographic studies were evaluated by an expert panel of spine surgeons. The panel then provided these recommendations to the referring surgeon. We investigated the underlying variability by looking both at the number of similar or different recommendations received by an individual patient (surgeon related variability) as well as the number of similar or different recommendations offered by individual surgeons across the population of patients (patient heterogeneity). Agreement between surgeons for fusion versus non-fusion (Categories 1-3 versus 4-6) was calculated using a Kappa value from a mixed effects logistic regression model. We looked at Kappa for agreement and weighted Kappa for association of ratings on the ordinal 1-6 scale with a mixed effects linear regression model. Additionally, we analyzed the summary of data between patients after averaging the rater scores within patients. Similarly, we summarized the data between surgeons after averaging their scores over the patients that each surgeon reviewed. Results One hundred and fourteen patients received 1463 treatment recommendations. On average, fusion was recommended 58.5% of the time. Overall agreement was low, and perfect agreement on the need for fusion was seen in only 24 (21.1%) of patients. Kappa statistic for agreement on fusion was 0.378 (95% CI 0.324 to 0.432). The average score across surgeons was 4.2 (0.6) with a range of 3 to 5.3. The most common single recommendation was for fusion with interbody fusion (40.8%) and the lowest was for decompression with non-instrumented fusion (0.5%). Conclusions We demonstrated variability in surgical approach when individual patients were evaluated by a panel of surgeons indicating that even "expert" surgeons disagree with each other regarding the need for fusion in individual patients. We were also able to demonstrate that individual patients received consistent recommendations that were very different from those received by other individuals evaluated by the same surgeons. This indicates that there is patient-related heterogeneity driving variability independent of surgeon factors.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    18
    References
    2
    Citations
    NaN
    KQI
    []