Top data mining tools for the healthcare industry

https://doi.org/10.1016/j.jksuci.2021.06.002 Get rights and content
Under a Creative Commons license
open access

Abstract

The healthcare industry has become increasingly challenging, requiring retrieval of knowledge from large amounts of complex data to find the best treatments. Several works have suggested the use of Data Mining tools to overcome the challenges; however, none of them has suggested the best tool to do so. To fill this gap, this paper presents a survey of popular open-source data mining tools in which data mining tool selection criteria based on healthcare application requirements is proposed and the best ones using the proposed selection criteria are identified. The following popular open-source data mining tools are assessed: KNIME, R, RapidMiner, Scikit-learn, and Spark. The study shows that KNIME and RapidMiner provide the largest coverage of healthcare data mining requirements.

Keywords

Data mining
Healthcare
Open-source data mining tools

Cited by (0)

Peer review under responsibility of King Saud University.