Please wait a minute...
Frontiers of Engineering Management

ISSN 2095-7513

ISSN 2096-0255(Online)

CN 10-1205/N

Postal Subscription Code 80-905

Front. Eng    2022, Vol. 9 Issue (4) : 577-591    https://doi.org/10.1007/s42524-022-0203-7
RESEARCH ARTICLE
Heterogeneous length-of-stay modeling of post-acute care residents in the nursing home with competing discharge dispositions
Nazmus SAKIB1, Xuxue SUN2, Nan KONG3, Chris MASTERSON4, Hongdao MENG5, Kelly SMITH5, Mingyang LI1()
1. Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL 33620, USA
2. College of Media Engineering, Communication University of Zhejiang, Hangzhou 310019, China
3. Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA
4. Greystone Health Network, Tampa, FL 33610, USA
5. School of Aging Studies, University of South Florida, Tampa, FL 33620, USA
 Download: PDF(2141 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Post-acute care (PAC) residents in nursing homes (NHs) are recently hospitalized patients with medically complex diagnoses, ranging from severe orthopedic injuries to cardiovascular diseases. A major role of NHs is to maximize restoration of PAC residents during their NH stays with desirable discharge outcomes, such as higher community discharge likelihood and lower re/hospitalization risk. Accurate prediction of the PAC residents’ length-of-stay (LOS) with multiple discharge dispositions (e.g., community discharge and re/hospitalization) will allow NH management groups to stratify NH residents based on their individualized risk in realizing personalized and resident-centered NH care delivery. Due to the highly heterogeneous health conditions of PAC residents and their multiple types of correlated discharge dispositions, developing an accurate prediction model becomes challenging. Existing predictive analytics methods, such as distribution-/regression-based methods and machine learning methods, either fail to incorporate varied individual characteristics comprehensively or ignore multiple discharge dispositions. In this work, a data-driven predictive analytics approach is considered to jointly predict the individualized re/hospitalization risk and community discharge likelihood over time in the presence of varied residents’ characteristics. A sampling algorithm is further developed to generate accurate predictive samples for a heterogeneous population of PAC residents in an NH and facilitate facility-level performance evaluation. A real case study using large-scale NH data is provided to demonstrate the superior prediction performance of the proposed work at individual and facility levels through comprehensive comparison with a large number of existing prediction methods as benchmarks. The developed analytics tools will allow NH management groups to identify the most at-risk residents by providing them with more proactive and focused care to improve resident outcomes.

Keywords nursing home      predictive analytics      individualized prediction      competing risks      health outcomes     
Corresponding Author(s): Mingyang LI   
Just Accepted Date: 16 September 2022   Online First Date: 07 November 2022    Issue Date: 08 December 2022
 Cite this article:   
Nazmus SAKIB,Xuxue SUN,Nan KONG, et al. Heterogeneous length-of-stay modeling of post-acute care residents in the nursing home with competing discharge dispositions[J]. Front. Eng, 2022, 9(4): 577-591.
 URL:  
https://academic.hep.com.cn/fem/EN/10.1007/s42524-022-0203-7
https://academic.hep.com.cn/fem/EN/Y2022/V9/I4/577
Proposed Sampling Algorithm
Step 1:Compute ?i(t|xi)=?s{C,H}d^sb(t)exp?(β^sTxi), where ?i(t|xi) is the instantaneous probability of resident i with xi being discharged at time t;
Step 2:Compute Si(l)=exp?[?p=0l?i(t(p)|xi)], l=0,...,N,N+1, where Si(l) is the probability of resident i still residing in NH at time t(l); t(0)<t(1)<...<t(l)<...<t(N)<t(N+1) and t(0)=0, t(N+1)=+; {t(l)}l=1n are ordered distinct historical LOS observations;
Step 3:Randomly generate μ=Unif(0,1);
Step 4:Compute l1=max{l:μ?Si(l)}, l2=min{l:minSi(l)?μ}, and get simulated LOS, Ti as Ti=Si(l)?μSi(l1)?Si(l2)?(t(l2)?t(l1))+t(l1);
Step 5:Determine disposition state s by drawing for the categorical distribution as ws=d^sb(t)exp?(β^sTxi)?i(t|xi), s{C,H}, i.e., sCategorical(w), where w=[wC,wH]T
  
CharacteristicsMean (SD) or %
Number of stays710
Number of residents611
Demographics
Age at admission (years)76.68 (10.66)
Gender: Female64.40%
Race
Black or African American6.90%
White90.30%
Others2.80%
Marital status
Never married14.20%
Married35.60%
Widowed33.20%
Divorced15.60%
Others1.40%
Care utilization
LOS (days)20.33 (15.72)
Admission origin
Community2.00%
Hospital97.30%
Others0.70%
Discharge disposition
Community79.90%
Hospital20.10%
Primary payer: Medicare Part A59.00%
Health characteristics
Height (inches)65.36 (4.13)
Weight (pounds)170.95 (54.45)
ADL score6.48 (3.85)
Mood/Depression score0.91 (1.24)
Cognitive score13.17 (2.96)
Visual impairment15.90%
Hearing impairment22.70%
Incontinence – urinary54.50%
Incontinence – fecal46.20%
Fall within past 180 days32.70%
Fracture within past 180 days18.00%
Diseases
Cancer8.00%
Heart/Circulation79.40%
Gastrointestinal38.60%
Genitourinary25.20%
Metabolic73.80%
Musculoskeletal42.00%
Neurological26.20%
Psychiatric/Mood disorder45.90%
Pulmonary34.60%
Tab.1  Descriptive summary statistics of the selected resident cohort
Covariate short nameMDS 3.0 covariate description
AgeAge at admission (years)
ADL scoreActivities of daily living score at admission
Cognitive scoreBrief interview for mental status (BIMS) summary score at admission
Mood scoreResident mood interview patient health questionnaire (PHQ)-9 total severity score at admission
Active diagnose indicator of resident at admission
CancerCancer (with or without metastasis)
AnemiaAnemia (e.g., aplastic, iron deficiency, pernicious, and sickle cell)
Atrial fibrillationAtrial fibrillation or other dysrhythmias (e.g., bradycardias and tachycardias)
Coronary artery diseaseCoronary artery disease (CAD) (e.g., angina, myocardial infarction, and atherosclerotic heart disease (ASHD))
DVT, PE, PTEDeep venous thrombosis (DVT), Pulmonary embolus (PE), or Pulmonary thrombo-embolism (PTE)
Heart failureHeart failure (e.g., congestive heart failure (CHF) and pulmonary edema)
HypertensionHypertension
GERD or UlcerGastroesophageal reflux disease (GERD) or Ulcer (e.g., esophageal, gastric, and peptic ulcers)
UC, CD, IBDUlcerative colitis (UC), Crohn’s disease (CD), or Inflammatory bowel disease (IBD)
BPHBenign prostatic hyperplasia (BPH)
Renal diseaseRenal insufficiency, Renal failure, or End-stage renal disease (ESRD)
Neurogenic bladderNeurogenic bladder
Obstructive uropathyObstructive uropathy
MDROMultidrug-resistant organism (MDRO)
PneumoniaPneumonia
SepticemiaSepticemia
Diabetes mellitusDiabetes mellitus (DM) (e.g., diabetic retinopathy, nephropathy, and neuropathy)
HyponatremiaHyponatremia
HyperlipidemiaHyperlipidemia (e.g., hypercholesterolemia)
ArthritisArthritis (e.g., degenerative joint disease (DJD), osteoarthritis, and rheumatoid arthritis (RA))
Hip fractureHip fracture (e.g., sub-capital fractures, and fractures of the trochanter and femoral neck)
Other fractureOther fracture
Alzheimer’s diseaseAlzheimer’s disease
Non-Alzheimer’s dementiaNon-Alzheimer’s dementia (e.g., Lewy body dementia, vascular or multi-infarct dementia; mixed dementia; frontotemporal dementia such as Pick’s disease; and dementia related to stroke, Parkinson’s or Creutzfeldt-Jakob diseases)
Hemiplegia or HemiparesisHemiplegia or Hemiparesis
MalnutritionMalnutrition (protein or calorie) or at risk for malnutrition
Anxiety disorderAnxiety disorder
DepressionDepression (other than bipolar)
SchizophreniaSchizophrenia (e.g., schizoaffective and schizophreniform disorders)
PTSDPost-traumatic stress disorder (PTSD)
Respiratory failureRespiratory failure
Tab.2  List of 35 selected covariates with the associated descriptions
CovariateVoteFeature selection methods
LinearNonlinear
Stepwise AICRFE: LinearSA: LinearLASSOFiltering: Random ForestRFE: Bagged TreesRFE: Random ForestGenetic: Random ForestSA: Random ForestBoruta
Community
ADL score80%
Cognitive score70%
Mood score80%
Cancer70%
Anemia70%
Atrial fibrillation60%
Coronary artery disease30%
DVT, PE, PTE80%
Heart failure70%
Hypertension100%
GERD or Ulcer40%
UC, CD, IBD30%
BPH50%
Renal disease80%
Neurogenic bladder60%
Obstructive uropathy50%
MDRO40%
Pneumonia50%
Diabetes mellitus40%
Hyponatremia90%
Hyperlipidemia60%
Arthritis30%
Hip fracture90%
Other fracture60%
Alzheimer’s disease60%
Non-Alzheimer’s dementia90%
Hemiplegia or Hemiparesis80%
Malnutrition30%
Anxiety disorder30%
Depression40%
Schizophrenia60%
PTSD40%
Respiratory failure30%
Hospital
Age30%
ADL score50%
Cognitive score20%
Hypertension50%
GERD or Ulcer30%
Obstructive uropathy20%
Septicemia20%
Hyponatremia20%
Hyperlipidemia30%
Arthritis30%
Non-Alzheimer’s dementia40%
Hemiplegia or Hemiparesis20%
PTSD30%
Respiratory failure40%
Tab.3  Summary of covariates identified by various feature selection methods
Model familyModel short nameModel descriptionDischarge dispositions
CommunityHospital
TrainTestTrainTest
With competing risks assumption
Semi-parametric survivalSP.CoxProposed: Cox regression with non-parametric baseline hazard0.7580.7680.7070.699
SP.Cox. LassoCox regression with L1-regularization0.7280.7470.5000.500
SP.Cox.Elastic NetCox regression with L1/L2-mixed regularization0.6690.6770.6690.677
Parametric survivalP.ExponentialSurvival regression with exponential baseline hazard0.2410.2310.2920.303
P.WeibullSurvival regression with Weibull baseline hazard0.2480.2480.2930.319
P.LogisticSurvival regression with Logistic baseline hazard0.2370.2060.2910.327
P.Log.logisticSurvival regression with Log-logistic baseline hazard0.2370.2040.2840.308
P.Log.normalSurvival regression with Log-normal baseline hazard0.2390.2150.2810.313
Independent of competing risks
Data mining: LinearML.Tobit RegTobit regression0.2430.2240.2880.324
ML.Linear RegLinear regression0.2650.2290.3340.318
ML.LassoLinear regression with L1-regularization0.2840.2420.5000.500
ML.RidgeLinear regression with L2-regularization0.2660.2250.3470.352
Data mining: Tree-basedML.R.ForestRandom Forest regression0.2900.2200.4910.261
ML.Boosting TreeBoosting Tree regression0.2290.2450.2380.170
ML.TreeDecision Tree regression0.3050.3110.3080.352
Tab.4  Prediction performance (C-index) comparison between the proposed and alternative models
Fig.1  Prediction performance (C-index) comparison under different discharge dispositions.
Covariateκ^jSE(κ^j)p-value
Community discharge
ADL score?0.1130.0141.11E-15***
Mood score?0.1430.0410.0005***
Cancer?0.4360.1620.0072**
Anemia?0.2030.0970.0367*
Hypertension?0.5590.1000***
BPH?0.5150.1410.0003***
Renal disease?0.3300.1340.0136*
MDRO?0.6280.2950.0331*
Hip fracture?0.6040.2330.0096**
Other fracture?0.4210.1270.0009***
Non-Alzheimer’s dementia?0.4480.1400.0014**
Hemiplegia or Hemiparesis?0.8460.2390.0004***
Malnutrition?0.5560.1960.0045**
Re/Hospitalization
ADL score0.0870.0240.00026***
Anemia0.4820.1800.00727**
Obstructive uropathy1.0280.3070.00080***
Diabetes mellitus0.5030.1700.00311**
Tab.5  Significant covariates identified by the proposed model for each disposition
Fig.2  Marginal effect of ADL score on survival curves for community discharge.
Fig.3  Marginal effect of ADL score on survival curves for re/hospitalization.
Fig.4  Survival curves of a healthy resident under different discharge dispositions.
Fig.5  Survival curves of an unhealthy resident under different discharge dispositions.
Fig.6  Sampling algorithm prediction performance.
Fig.7  Discharge disposition prediction.
Fig.8  Comparison of prediction performance between the proposed sampling algorithm and alternative models for all discharge dispositions.
Fig.9  Comparison of prediction performance between the proposed sampling algorithm and alternative models for hospital transfer disposition.
Fig.10  Comparison of prediction performance between the proposed sampling algorithm and alternative models for community discharge disposition.
CovariateDistributionParameterAcuity scenarioLimitsSD
Less acuteMore acute
12345678
ADL scoreTruncated normalMean13579111315[0, 16]4
Mood scoreTruncated normalMean12345678[1, 8]2
Disease incidence priorBetaMean0.10.20.30.40.50.60.70.8[0, 1]
SD0.0900.1210.1380.1480.1510.1480.1380.121
Tab.6  Experimental settings of acuity scenarios in the simulation study
MetricMeasureAcuity scenario
12345678
LOSMean11.7713.2115.2518.0221.3725.4028.7431.36
SE0.200.180.230.320.330.540.720.66
Disposition: Community30-day discharge rateMean0.980.960.920.840.740.640.530.45
SE0.00090.00230.00240.00420.00510.00660.00550.0043
45-day discharge rateMean1.000.990.980.940.880.800.720.64
SE0.00020.00100.00120.00260.00380.00560.00500.0045
60-day discharge rateMean1.001.000.990.970.930.880.810.74
SE0.000090.00050.00070.00170.00270.00450.00420.0041
Disposition: Hospital30-day discharge rateMean0.110.130.140.160.190.210.240.26
SE0.00070.00130.00150.00250.00300.00290.00370.0030
45-day discharge rateMean0.170.180.210.240.270.300.340.36
SE0.00100.00180.00210.00340.00380.00370.00470.0038
60-day discharge rateMean0.200.220.250.280.320.360.390.43
SE0.00120.00210.00240.00390.00420.00400.00520.0041
Tab.7  Facility-level performance results under various census acuity scenarios of an NH
Fig.11  Comparison of facility-level performance results under various census acuity scenarios.
1 P C Austin, D M Rothwell, J V Tu, (2002). A comparison of statistical modeling strategies for analyzing length of stay after CABG surgery. Health Services and Outcomes Research Methodology, 3( 2): 107–133
https://doi.org/10.1023/A:1024260023851
2 P Cappanera, F Visintin, C Banditori, (2014). Comparing resource balancing criteria in master surgical scheduling: A combined optimisation-simulation approach. International Journal of Production Economics, 158: 179–196
https://doi.org/10.1016/j.ijpe.2014.08.002
3 K Carey, (2002). Hospital length of stay and cost: A multilevel modeling analysis. Health Services and Outcomes Research Methodology, 3( 1): 41–56
https://doi.org/10.1023/A:1021530924455
4 for Medicare CentersServices (CMS) Medicaid (2013). MDS 3.0 Quality Measures: User’s Manual. Research Triangle Park, NC: RTI International
5 Centers for Medicare and Medicaid Services (CMS) (2017). Long-Term Care Facility Resident Assessment Instrument 3.0 User’s Manual
6 S R Cole, H Chu, S Greenland, (2014). Maximum likelihood, profile likelihood, and penalized likelihood: A primer. American Journal of Epidemiology, 179( 2): 252–260
https://doi.org/10.1093/aje/kwt245
7 D R Cox, (1972). Regression models and life-tables. Journal of the Royal Statistical Society: Series B, Methodological, 34( 2): 187–202
https://doi.org/10.1111/j.2517-6161.1972.tb00899.x
8 R B D’Agostino, B H Nam, (2003). Evaluation of the performance of survival analysis models: Discrimination and calibration measures. Handbook of Statistics, 23: 1–25
https://doi.org/10.1016/S0169-7161(03)23001-7
9 S EikenK SredlL GoldJ KastenB BurwellP Saucier (2014). Medicaid Expenditures for Long-Term Services and Supports in FFY 2012. Bethesda, MD: Truven Health Analytics
10 E El-Darzi, C Vasilakis, T Chaussalet, P H Millard, (1998). A simulation modelling approach to evaluating length of stay, occupancy, emptiness and bed blocking in a hospital geriatric department. Health Care Management Science, 1( 2): 143–149
https://doi.org/10.1023/A:1019054921219
11 M Faddy, N Graves, A Pettitt, (2009). Modeling length of stay in hospital and other right skewed data: Comparison of phase-type, gamma and log-normal distributions. Value in Health, 12( 2): 309–314
https://doi.org/10.1111/j.1524-4733.2008.00421.x
12 S A Fashaw, K S Thomas, E McCreedy, V Mor, (2020). Thirty-year trends in nursing home composition and quality since the passage of the Omnibus Reconciliation Act. Journal of the American Medical Directors Association, 21( 2): 233–239
https://doi.org/10.1016/j.jamda.2019.07.004
13 R B GinsburgUS Supreme Court of the (1998). US Reports: Olmstead v. L.C., 527 US 581
14 D GonzàlezM PiñaL (2008) Torres. Estimation of parameters in Cox’s proportional hazard model: Comparisons between Evolutionary Algorithms and the Newton-Raphson Approach. In: Mexican International Conference on Artificial Intelligence. Berlin, Heidelberg: Springer, 513–523
15 P R Hachesu, M Ahmadi, S Alizadeh, F Sadoughi, (2013). Use of data mining techniques to determine and predict length of stay of cardiac patients. Healthcare Informatics Research, 19( 2): 121–129
https://doi.org/10.4258/hir.2013.19.2.121
16 F E Harrell Jr, K L Lee, D B Mark, (1996). Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Statistics in Medicine, 15( 4): 361–387
https://doi.org/10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4
17 A A Holup, K Hyer, H Meng, L Volicer, (2017). Profile of nursing home residents admitted directly from home. Journal of the American Medical Directors Association, 18( 2): 131–137
https://doi.org/10.1016/j.jamda.2016.08.017
18 N R Hoot, L J LeBlanc, I Jones, S R Levin, C Zhou, C S Gadd, D Aronsky, (2008). Forecasting emergency department crowding: A discrete event simulation. Annals of Emergency Medicine, 52( 2): 116–125
https://doi.org/10.1016/j.annemergmed.2007.12.011
19 R A Incalzi, A Gemma, O Capparella, L Terranova, P Porcedda, E Tresalti, P Carbonin, (1992). Predicting mortality and length of stay of geriatric patients in an acute care general hospital. Journal of Gerontology, 47( 2): M35–M39
https://doi.org/10.1093/geronj/47.2.M35
20 A Kelly, J Conell-Price, K Covinsky, I S Cenzer, A Chang, W J Boscardin, A K Smith, (2010). Length of stay for older adults residing in nursing homes at the end of life. Journal of the American Geriatrics Society, 58( 9): 1701–1706
https://doi.org/10.1111/j.1532-5415.2010.03005.x
21 A A Kramer, J E Zimmerman, (2010). A predictive model for the early identification of patients at risk for a prolonged intensive care unit length of stay. BMC Medical Informatics and Decision Making, 10( 1): 27
https://doi.org/10.1186/1472-6947-10-27
22 L C McGuire, E S Ford, C A Okoro, (2007). Natural disasters and older US adults with disabilities: Implications for evacuation. Disasters, 31( 1): 49–56
https://doi.org/10.1111/j.1467-7717.2007.00339.x
23 M MoonB GageA Evans (1997). An examination of key Medicare provisions in the Balanced Budget Act of 1997. New York: The Commonwealth Fund
24 Y Murad, (2011). Skilled nursing facilities and post-acute care. Journal of Gerontology & Geriatric Research, 1( 101): 1–4
25 Investment Center for Seniors Housing & Care (NIC) National (2018). Skilled Nursing Data Report: Key Occupancy & Revenue Trends. 4Q2017
26 P W New, K Stockman, P A Cameron, J H Olver, J U Stoelwinder, (2015). Computer simulation of improvements in hospital length of stay for rehabilitation patients. Journal of Rehabilitation Medicine, 47( 5): 403–411
https://doi.org/10.2340/16501977-1957
27 P C Pendharkar, H Khurana, (2014). Machine learning techniques for predicting hospital length of stay in Pennsylvania federal and specialty hospitals. International Journal of Computer Science & Applications, 11( 3): 45–56
28 M Taboada, E Cabrera, M L Iglesias, F Epelde, E Luque, (2011). An agent-based decision support system for hospitals emergency departments. Procedia Computer Science, 4: 1870–1879
https://doi.org/10.1016/j.procs.2011.04.203
29 L Turgeman, J H May, R Sciulli, (2017). Insights from a machine learning model for predicting the hospital Length of Stay (LOS) at the time of admission. Expert Systems with Applications, 78: 376–385
https://doi.org/10.1016/j.eswa.2017.02.023
30 J Wang, J Li, K Tussey, K Ross, (2012). Reducing length of stay in emergency department: A simulation study at a community hospital. IEEE Transactions on Systems, Man, and Cybernetics: Part A, Systems and Humans, 42( 6): 1314–1322
https://doi.org/10.1109/TSMCA.2012.2210204
31 H Xie, T J Chaussalet, P H Millard, (2005). A continuous time Markov model for the length of stay of elderly people in institutional long-term care. Journal of the Royal Statistical Society, Series A (Statistics in Society), 168( 1): 51–61
https://doi.org/10.1111/j.1467-985X.2004.00335.x
32 X Zhang, S Barnes, B Golden, M Myers, P Smith, (2019). Lognormal-based mixture models for robust fitting of hospital length of stay distributions. Operations Research for Health Care, 22: 100184
https://doi.org/10.1016/j.orhc.2019.04.002
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed