Mining Health Examination Records - A Disease Detection Approach

Shelke Swapnali K, Korde Sachin K.


General health examination is an integral part
of healthcare in many countries like India. To identify the
participants at risk is important for early warning and preventive
measures. The majority of the collected dataset is the unlabeled
data, which is the fundamental challenge of learning a classication
model for risk prediction. Usually, the unlabeled data describes
the participants health conditions varying greatly from healthy
to very-ill. In this paper, we use a graph-based, semi-supervised
learning algorithm called SHG-Health (Semisupervised Heterogeneous Graph on Health) for risk predictions to classify a progressively developing situation with the majority ofthedataunlabeled.Wearegoingtoperformexperimentsbased on both real health
examination datasets and synthetic datasets are performed to
show the effectiveness and efciency of our method.

Full Text:



M.F.Ghalwash, V.Radosavljevic, and Z.Obradovic, Extraction of interpretable multivariate patterns for early diagnostics, IEEE International

Conference on Data Mining, pp. 201210, 2013.

T. Tran, D. Phung, W. Luo, and S. Venkatesh, Stabilized sparse ordinal

regression for medical risk stratication, Knowledge and Information

Systems, pp. 128, Mar. 2014.

M. S. Mohktar, S. J. Redmond, N. C. Antoniades, P. D. Rochford, J.

J. Pretto, J. Basilakis, N. H. Lovell, and C. F. McDonald, Predicting

the risk of exacerbation in patients with chronic obstructive pulmonary

disease using home telehealth measurement data, Articial Intelligence in

Medicine, vol. 63, no. 1, pp. 5159, 2015.

J. M. Wei, S. Q. Wang, and X. J. Yuan, Ensemble rough hypercuboid

approach for classifying cancers, IEEE Transactions on Knowledge and

Data Engineering, vol. 22, no. 3, pp. 381391, 2010.

E.Kontio,A.Airola,T.Pahikkala,H.LundgrenLaine,K.Junttila,H.Korvenranta,T. Salakoski, and S. Salantera, Predicting

patient acuity from electronic patient records. Journal of Biomedical

Informatics, vol. 51, pp. 813, 2014.

Q. Nguyen, H. Valizadegan, and M. Hauskrecht, Learning classication

models with soft-label information. Journal of the American Medical

Informatics Association : JAMIA, vol. 21, no. 3, pp. 5018, 2014.

G. J. Simon, P. J. Caraballo, T. M. Therneau, S. S. Cha, M. R. Castro,

and P. W. Li, Extending Association Rule Summarization Techniques

to Assess Risk of Diabetes Mellitus, IEEE Transactions Knowledge and

Data Engineering, vol. 27, no. 1, pp. 130141,2015.


Copyright © IJETT, International Journal on Emerging Trends in Technology