This White Paper is an informational December 2019 solicitation of feedback rather than a statement of final policy. It presents multiple analytic options and highlights methodological issues HHS is considering; comments were due January 6, 2020. The paper makes clear these options were developed from internal analyses, stakeholder input, and testing with available HHS-RADV data to inform potential future rulemaking rather than to change policy immediately.
Analytic issues identified include how to size IVA samples and treat small issuers (current practice uses a default IVA sample size of 200 for most issuers with finite population corrections or smaller fixed minimums for small populations, and exemptions for issuers with ≤ 500 billable member months or ≲ $15,000,000 in annual premiums). The paper discusses options to vary sample sizes, allow issuers to elect larger samples, or re-evaluate the standard sample using national HHS-RADV data to balance precision targets (a 10% two-sided 95% CI goal) against operational burden.
The White Paper also outlines identified problems and candidate approaches for outlier detection and error-rate calculation. For outlier logic it notes the current national, static confidence-interval approach does not adjust for issuer HCC counts and that HCC grouping counts below 30 reduce practical confidence below the 95% theoretical level. Options include issuer-specific confidence intervals (bootstrapping, binomial, McNemar, Bayesian), multiple national CIs by HCC count, or machine learning, and approaches to account for HCC hierarchy interactions or to assess pre/post RADV score differences directly.
To address the "payment cliff" and incentives around negative error rates, the paper presents alternatives including adjusting only to confidence interval edges, adjusting only for positive outliers, sliding-scale adjustments between the CI edge and group mean (various z-score ranges), and temporarily constraining negative outlier issuer failure rates to 0 when calculating the Group Adjustment Factor (GAF). These are presented as options for comment and further testing rather than adopted changes.