Machine learning identifies abnormal Ca2+ transients in human induced pluripotent stem cell-derived cardiomyocytes

Hwang, Hyun; Liu, Rui; Maxwell, Joshua T.; Yang, Jingjing; Xu, Chunhui

doi:10.1038/s41598-020-73801-x

Download PDF

Article
Open access
Published: 12 October 2020

Machine learning identifies abnormal Ca²⁺ transients in human induced pluripotent stem cell-derived cardiomyocytes

Scientific Reports volume 10, Article number: 16977 (2020) Cite this article

2152 Accesses
19 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) provide an excellent platform for potential clinical and research applications. Identifying abnormal Ca²⁺ transients is crucial for evaluating cardiomyocyte function that requires labor-intensive manual effort. Therefore, we develop an analytical pipeline for automatic assessment of Ca²⁺ transient abnormality, by employing advanced machine learning methods together with an Analytical Algorithm. First, we adapt an existing Analytical Algorithm to identify Ca²⁺ transient peaks and determine peak abnormality based on quantified peak characteristics. Second, we train a peak-level Support Vector Machine (SVM) classifier by using human-expert assessment of peak abnormality as outcome and profiled peak variables as predictive features. Third, we train another cell-level SVM classifier by using human-expert assessment of cell abnormality as outcome and quantified cell-level variables as predictive features. This cell-level SVM classifier can be used to assess additional Ca²⁺ transient signals. By applying this pipeline to our Ca²⁺ transient data, we trained a cell-level SVM classifier using 200 cells as training data, then tested its accuracy in an independent dataset of 54 cells. As a result, we obtained 88% training accuracy and 87% test accuracy. Further, we provide a free R package to implement our pipeline for high-throughput CM Ca²⁺ analysis.

A dynamic clamping approach using in silico IK1 current for discrimination of chamber-specific hiPSC-derived cardiomyocytes

Article Open access 18 March 2023

Simultaneous measurement of excitation-contraction coupling parameters identifies mechanisms underlying contractile responses of hiPSC-derived cardiomyocytes

Article Open access 20 September 2019

A machine learning algorithm for electrocardiographic fQRS quantification validated on multi-center data

Article Open access 26 April 2022

Introduction

Cardiomyocytes derived from human-induced pluripotent stem cells (hiPSC-CMs) are highly desired for drug discovery and modeling human development and disease, as alternative models such as human primary CMs are hard to obtain¹. Although current hiPSC-CMs display fetal-like phenotypes in terms of their structural and electrophysiological properties², they have increasingly been used to study normal cardiac functionality^3,4,5 and human cardiovascular diseases such as long QT syndrome, catecholaminergic polymorphic ventricular tachycardia and viral myocarditis as well as for high-throughput cardiotoxicity screening^5,6,7,8,9. Furthermore, hiPSC-CMs are under active investigation for use as a cell source for possible clinical usage^10,11. For these applications, extensive functional characterization of hiPSC-CMs is required.

Ca²⁺ transients are a fundamental characteristic of cardiomyocyte functionality, although cardiac action potentials and contractility are also commonly used to study cardiomyocyte functionality by methods such as patch clamp, multielectrode array, microscopic video analysis, and fluorescence imaging¹². Coordinated movement of Ca²⁺ at single cell level plays a key role to control contraction of the heart by the conversion of electric excitation into mechanical contraction. Specifically, each action potential induces Ca²⁺ influx, which triggers a much greater Ca²⁺ release from the sarcoplasmic reticulum (SR). The increased cytosolic Ca²⁺ binds to and activates the Ca²⁺-sensing protein of the contractile apparatus and initiates CM contraction. Then Ca²⁺ is removed from the cytosol through reuptake into the SR or extrusion into the extracellular space, which leads to CM relaxation. Thus, the rapid release and reuptake of Ca²⁺ between the SR and the cytosol create a Ca²⁺ transient inside the CM¹³. Abnormal Ca²⁺ signals are indicative of various cardiac pathologies, such as arrhythmia^5,6,8.

An accurate Ca²⁺ transient analysis is an important component of hiPSC-CM phenotype analysis. Ca²⁺ transient characteristics are commonly captured with Ca²⁺-specific fluorescent dye, and fluorescence imaging is the most optimal for high-throughput application. Human experts’ assessment of Ca²⁺ transient signals is often based on Ca²⁺ transient morphology characterized by rapid upstroke and decay kinetics. Although manual identification of abnormal Ca²⁺ transients by human experts is often taken as gold standards, such visual assessment is labor-intensive, time-consuming, and subjective to the assessor’s expertise. Moreover, manual identification of abnormal Ca²⁺ transients by human experts becomes a bottleneck hindering its application to high-throughput analysis. Thus, a user-friendly computational tool is in pressing need to mitigate the bottleneck of manual analysis and to enable automatic assessment of Ca²⁺ transient abnormality.

Previously, Juhola et al. proposed an Analytical Algorithm to detect cycling Ca²⁺ transient peaks, quantify peak variables, and assess the abnormality of transient peaks and signals¹⁴. This analytical algorithm identifies signal abnormality based on whether the assessed cell signal contains at least one abnormal transient peak based solely on characteristics of a single peak. The assessment did not leverage shared characteristics of normal and abnormal Ca²⁺ transient peaks and signals across all samples, which are expected to provide valuable input to improve the accuracy of signal abnormality assessment. Further, the analytical algorithm fails to account for the valuable manual assessment results about existing data.

To overcome these limitations, we develop an improved automatic pipeline that is composed of peak detection, peak variable quantification, peak abnormality assessment, signal variable extraction, and signal abnormality assessment. We adapt the existing Analytical Algorithm for peak detection, peak variable quantification, and peak abnormality assessment. Additionally, the advanced machine learning method of Support Vector Machine (SVM)¹⁵ is used for abnormality assessments of peaks and signals, which leverages shared data characteristics and experts’ manual analysis results of training data.

Further, we provide an R library to implement this pipeline, which includes SVM classifiers trained using our Ca²⁺ transient data as well as functions for peak detection, peak variable quantification, training peak-level SVM classifier, cell variable quantification, training signal-level SVM classifier, and predicting signal abnormality. Our R library is freely available through GitHub and is expected to serve as a convenient tool for people in need of a Ca²⁺ transient analysis software with high speed and accuracy.

Results

Study overview

A flowchart is provided in Fig. 1a,b for this pipeline. In this pipeline, we improve the existing Analytical Algorithm¹⁴ to better characterize signal abnormality by including additional peak variables such as nearby peak distance, varying peak amplitude, and peak asymmetry, as well as considering irregular peak phases for peak abnormality assessment.

In particular, with a set of Ca²⁺ signals as the training data, our pipeline first trains a peak-level SVM classifier by taking peak assessments by human experts as responses (normal or abnormal) and 14 peak variables (the names of the peak variables are listed in Table 1) as predicting features. Second, assessments of peak abnormality by both our improved analytical algorithm and trained peak-level SVM classifiers are obtained and used as additional peak variables. Third, cell abnormality assessment based on those two types of peak assessments along with other cell variables (the names of the cell variables are listed in Table 2) are taken as predictors to train a signal-level SVM classifier for predicting signal abnormality. The trained peak-level and cell-level SVM classifiers can be applied to detect the abnormality of additional Ca²⁺ transient peaks and signals. Additionally, we validated our pipeline using Ca²⁺ transient data generated by our lab.

Table 1 Peak variable averages and their standard deviations of the test data.

Full size table

Table 2 Cell variable averages and their standard deviations of the test data.

Full size table

Data preprocessing

Ca²⁺ transient signal data were generated using MetaXPress software. While the sampling frequency for the transient was 5 Hz across the board, lengths varied between 12 and 32 s. Signals with single peaks were eliminated, as they were insufficient to count as signal data. In particular, we first generated 213 signals: 78 signals from the 12-s dataset and 135 signals from the 32-s dataset. After single-peak signal elimination, 66 from the 12-s dataset and 134 signals from the 32-s dataset were taken as our training dataset. The data were tidied and plotted for assessment by human experts. Human experts labeled peaks and signals as either normal or abnormal. These 200 signals were taken as our training data to validate our proposed pipeline. Following the same procedure, an independent test dataset of 54 cells were generated.

Abnormality assessment by human experts

Upon inspection of Ca²⁺ transient signals such as the one shown in Fig. 2a,b, a human expert in assessing Ca²⁺ transient signals made abnormality assessment about the Ca²⁺ transient peaks and signals. For our training dataset, the expert made abnormality assessments for a total of 200 signals and 1893 peaks within those signals. A peak was labeled as normal if the transient had typical cardiac Ca²⁺ transient morphology (i.e. rapid upstroke and decay kinetics), no oscillations of the diastolic Ca²⁺ signal, and no obvious spontaneous Ca²⁺ release between transients (Fig. 2c-i, ii). A peak was labeled as abnormal if any of above criteria was not met (Fig. 2c-iii-vi). A cell was labeled as normal if all of the peaks within the cell were normal and of consistent amplitudes and rhythmicity (Fig. 2c-i). A cell was labeled as abnormal if any of above criteria was not met (Fig. 2c-ii-vi).

Peak detection

To detect the peaks of Ca²⁺ transient signals, we improved the analytical method proposed by Juhola et al.¹⁴. Specifically, for each Ca²⁺ transient signal, the first derivative values of signal intensities at the observed timeframe points are first calculated by using the Trapezium rule¹⁶. Second, a sequential screening strategy is taken to identify the starting, maximum, and ending timeframe points for all peaks presented in the signal. That is, starting from the initial timeframe point or the timeframe point right after the ending of the previous peak, the next timeframe point with first derivative value greater than a pre-defined threshold t_up (default 30) is considered as the beginning (i.e., peak left) timeframe point of the current peak. Starting from the peak left, first derivative values should be positive before peak maximum point while negative after peak maximum. Thus, the first timeframe point after the peak left point with a negative derivative value is taken as the maximum timeframe point of the current peak (i.e., peak maximum). The first timeframe point after peak maximum with a positive derivative value whose absolute value is greater than a pre-defined threshold rt_up (default 2; to get around possible noisy signal fluctuations) is taken as the end of the current peak (i.e., peak right). The default value for t_up is set as used by Juhola et al.¹⁴, and the default value for rt_up is determined based on our experiments. In particular, taking rt_up = 0 is equivalent as taking the first timeframe point after peak maximum with a positive first derivative value as peak end.

To avoid the identification of a partial or noisy first peak within a signal, we exclude the first peak that is asymmetric with left amplitude less than 50% of the right amplitude and intensity value < 5. We also exclude noisy peaks with peak amplitudes less than 15% of the maximum amplitude within the signal. To ensure that our detected peaks are valid with minimal noise or partial peaks, signals with no peak or a single peak are excluded from our analyses.

Peak variable quantification

Fourteen peak variables are quantified after peak detection (Fig. 3) and then used for peak abnormality assessment by both analytical and SVM methods.

The 14 peak variables are as follows: peak left amplitude (A_l), peak right amplitude (A_r), amplitude difference between A_l and A_r (A_d), duration from peak left to peak max (D_l), duration from peak max to peak right (D_r), maximum first derivative value from peak left to peak max (Dy_max), absolute minimum first derivative value from peak max to peak right (Dy_min) maximum second derivative value from peak left to peak max (D2y_max), absolute minimum second derivative value from peak max to peak right (D2y_min), peak area under the intensity curve from peak left to peak right (R), duration from the previous peak max to current peak max (delta, i.e., peak distance), duration from peak left to Dy_max (delta_l2Dymax), duration from peak max to Dy_min (delta_m2Dymin), and median of delta values within a signal (Peak_distance_median). These quantified peak variables are used for training peak-level SVM classifier and subsequent cell-level SVM classifier.

Peak abnormality assessment by improved analytical algorithm

Here, peak max amplitude and min amplitude respectively refer to the maximum and minimum of A_l and A_r. In addition to peak amplitudes and asymmetry as considered by previous method¹⁴, our improved Analytical Algorithm also considers irregular phase to assess peak normality based on peak distances (delta) within one signal.

We first assess peak normality with respect to peak amplitudes. That is, the first peak will be labeled as abnormal if the peak max amplitude is less than 50% of the average peak max amplitude within the same signal. Peaks other than the first one will be labeled as abnormal if the preceding peak is abnormal and the peak max amplitude is less than 50% of the average peak max amplitude within the same signal, or if the peak amplitude is less than 50% of the preceding normal peak. Second, a peak with normal amplitude characteristics will be labeled as abnormal when the peak min amplitude is less than 85% of the peak max amplitude (i.e., asymmetric). Last, irregular phase assessment will be considered. A symmetric peak with normal amplitude but distance from previous peak to current peak (delta; except for the first peak) greater than 90% of the median delta within the same signal (i.e., irregular phase) will be labeled as abnormal. All thresholds are chosen based on our experimental training data and can be adjusted according to new data characteristics.

Train peak-level SVM classifier

To employ expert peak assessments and peak characteristics of training data, we train a peak-level SVM classifier to predict peak normality status, taking expert peak assessments as outcome and these 14 peak variables as described in previous subsection as predictive features. To avoid the issue of overfitting for accuracy assessment with training data, we take the LOOCV approach¹⁷ to fit peak-level SVM classifiers and make predictions for all samples in the training dataset. In particular, peaks within a signal are taken as test data and a corresponding peak-level SVM classifier is trained using peaks from all other samples, which is iterated for all signals to obtain predictions of all peaks. The peak normality predictions by the LOOCV approach will then be used to train the follow-up cell-level SVM classifier.

Train cell-level SVM classifier

The cell normality labels based on peak normality assessments obtained by our improved Analytical Algorithm and SVM-LOOCV approach are considered as cell variables. We consider additional cell variables as follows: proportion of abnormal peaks per signal (prop_abnormal), variance of peak amplitude per signal (var_A), variance of peak distances per signal (var_delta), and variance of peak areas per signal (var_R). These cell variables are centered and standardized and then used as predictive features to train a cell-level SVM classifier to predict cell abnormality, where outcomes are taken as human-expert assessments about cell normality. This trained cell-level SVM classifier can then be used to predict cell normality for additional independent signals.

Application studies

To validate the above described pipeline (Fig. 1) for analyzing Ca²⁺ transient data, we applied the pipeline to study the Ca²⁺ transient data of 254 cells generated in our lab. In particular, we took 200 cell signals (containing 1893 peaks) as our training data and 54 cell signals (454 peaks) as our test data. We first manually assessed the normality of all of these signals and peaks that were considered as gold standards and taken as outcome variables for training SVM classifiers. Second, by applying our improved Analytical Methods to assess peak normality, we obtained 93.3% accuracy, 91.1% sensitivity, and 95.8% specificity (Table 3). Third, by the SVM-LOOCV approach to assess peak normality, we obtained 92.2% accuracy, 91.8% sensitivity, and 95.3% specificity (Table 3). Cell abnormality assessments based on these two peak assessments were then taken together with other cell variables to train a cell-level classifier. By using the LOOCV approach with our training data, our cell-level SVM classifier obtained 89.9% accuracy, 94.7% sensitivity, and 83.3% specificity for cell assessments (Table 4).

Table 3 Peak abnormality assessment accuracy.

Full size table

Table 4 Cell abnormality assessment accuracy.

Full size table

With the cell-level SVM classifier trained by using our training data, we then validated the accuracy of cell abnormality assessment with 54 additional test cells. To begin with, by using our pipeline, the Ca²⁺ transient peaks in the test dataset were identified, and peak-level variables were quantified, followed by analytical algorithm peak status assessment. Then, the peak-level SVM classifier trained using our training data produced peak status prediction for each identified peak in the test data. Cell status assessments based on these two peak assessments were used together with other cell variables to predict the final cell normality status by using the trained cell-level SVM model from our training data. As a result, we obtained 87.0% accuracy, 88.9% sensitivity, and 83.3% specificity (Table 4). Compared to the cell abnormality assessments by existing Analytical Algorithm (83.3% accuracy, 83.3% sensitivity, and 83.3% specificity), our SVM approach obtained higher sensitivity and accuracy for borrowing strength across all peaks and signals by SVM method.

In addition, we constructed a receiver-operating curve (ROC)¹⁸ for both training data and test data based on the classification outcomes of each cell signal by using our trained SVM classifier. As shown in Fig. 4, our trained SVM classifier showed excellent results, with area under the curve (AUC)¹⁸ of 0.97 and 0.95 for the training and test dataset, respectively. The AUC is the probability that a classifier will rank a randomly chosen abnormal cell higher than a randomly chosen normal cell (assuming 'abnormal' ranks higher than 'normal')¹⁸.

Discussion

In this study, we develop an automatic pipeline for assessing the normality of hiPSC-CM Ca²⁺ transient signals, an otherwise labor-intensive and time-consuming phenotypic analysis for CMs. Specifically, we improve the existing Analytical Algorithm¹⁴ by accounting for irregular phases within signals and employ the advanced machine learning SVM method for peak and cell abnormality prediction. We also validate our approach of using advanced machine learning SVM method in this pipeline by using training and test hiPSC-CM Ca²⁺ transient signals generated by our lab. With independent test data, we demonstrate that our SVM approach obtained 87.0% accuracy (versus 83.3% accuracy obtained by Analytical Algorithm).

Our results show the advantages of learning normal and abnormal characteristics across multiple peaks and cells as well as employing the valuable human-expert assessments of training data. Although our improved Analytical Algorithm yielded excellent peak assessment accuracy of 93.3% with our training data, its cell-level assessment accuracy was 87.5% with our training data and 83.3% with our test data. The decent peak abnormality assessment accuracy by Analytical Algorithm is probably because the Analytical Algorithm is developed to mimic human-expert assessment. In contrast, our cell-level SVM classifier obtained accuracy 89.9% with our training data by the LOOCV approach and accuracy 87.0% with our test data. The relatively lower accuracy for signal abnormality assessment is likely because the Analytical Algorithm fails to account for abnormality due to abnormal characteristics of multiple peaks and signals such as signals with irregular phases.

Automatic identification of Ca²⁺ transients can overcome limitation of traditional manual assessment. Manual signal abnormality assessment is difficult since recordings are often short, contain small number of peaks, with varying morphologies of signals and peaks within them. Abnormal cell signals are often difficult to be identified consistently by multiple human-experts. This may be due to the fact that (1) there are peaks of small amplitude that are borderline noise, (2) the nature of the cell signal morphology renders it difficult to exactly characterize, for instance due to continuously decreasing fluorescence intensity, among other reasons.

Our signal classification results are already applicable for current use. Its potential is even larger as more data can be fed into the training set with ease. By incorporating more data collected and analyzed by different Ca²⁺ transient experts, we expect our model to be better modified for more nuanced prediction of novel Ca²⁺ transient data. Various other types of Ca²⁺ transient signals can be assessed by a human expert to further modify the prediction model based on any potential need of any user.

Currently, our machine learning SVM classifiers have been trained using hiPSC-CMs derived from two different strains of stem cells—SCVI-273 and IMR-90—which produce very similar Ca²⁺ transient signals. This could render the machine learning model biased toward certain signal patterns. As we accumulate more data, we expect to see further improvements in the overall accuracy and efficiency of our proposed machine learning method. In addition, training the model with various other hiPSC-CMs derived from different cell lines including disease cell lines (see, for example, Juhola et al.¹⁹) at multiple differentiation stages will significantly improve the generalizability of our machine learning method in such a way that will allow us to capture the underlying essence of seemingly different patterns of Ca²⁺ transient signals among CMs of different sources. This will, in turn, enhance its ability to be used as a scalable tool for analyzing high-throughput CM data for various purposes, such as drug screening. As our machine learning model incorporates more diverse sets of data, we anticipate its usage to evolve as well: from an aide for a busy human-expert to eventually fully replicating the decision-making of a human-expert on all patterns of Ca²⁺ transient signal, regardless of its origin and experimental procedures employed.

Our Ca²⁺ transient analysis software that implements our automatic pipeline with machine learning SVM method is available in the form of R package for everyone in need of Ca²⁺ transient analysis tool.

Methods

Culture of hiPSCs and cardiomyocyte differentiation

Undifferentiated SCVI-273 hiPSCs (Stanford Cardiovascular Institute)⁹ and IMR90 hiPSCs (WiCell Research Institute)²⁰ were fed daily on Matrigel-coated plates with mTeSR1 defined medium (Stem Cell Technologies, 85850) and passaged using Versene (Thermo Fisher Scientific, 15040066) when compact colonies reached 90–100% confluence. For CM differentiation, hiPSCs were induced using a growth factor-guided differentiation protocol^21,22. At the day of induction (day 0), medium was replaced with RPMI 1640 medium supplemented with 2% B27 minus insulin (Thermo Fisher Scientific, A1895601) and 100 ng/ml activin A (R&D Systems, 338-AC-050/CF). After 24 h (day 1), RPMI supplemented with 2% B27 minus insulin was used for 24 h. After 24 h (day 1), activin A was replaced with 10 ng/ml BMP4 (R&D Systems, 314-BP-050/CF), and cells were cultured without any medium change for the next 3 days. From day 4, the growth factor-containing medium was replaced with RPMI supplemented with 2% regular B27 (Thermo Fisher Scientific, 17504044) and the medium was changed every other day. hiPSC-CMs were further enriched by the metabolic selection method using RPMI without glucose (Thermo Fisher Scientific, 11879020) supplemented with 2% B27 and 5 mM lactate from day 11 to 14²³. Alternatively, enriched hiPSC-CMs were generated by microscale generation of cardiospheres at day 6²⁴. Cells were observed under a microscope daily for beating cells, which typically appeared by day 8–10. At day 14, a parallel culture of cells were harvested to determine CM purity before subsequent assessments.

Ca²⁺ transient assay

Live cell imaging of intracellular Ca²⁺ transient was performed using Fluo-4 AM (Thermo Fisher Scientific, F14202). At differentiation day 18, cells were seeded in a 96-well plate at a low density to acquire single-cell Ca²⁺ transients. At differentiation days 20 to 22, cells were treated with or without arrhythmogenic drugs including TNF-α, ethanol, and melphalan for 3 to 5 days. At differentiation days 23 to 25, cells were acquired for Ca²⁺ transient signals. Beating hiPSC-CMs were incubated with 10 µM Fluo-4 AM for 25 min at 37℃ followed by a 5 min wash with warm 1 × Normal Tyrode solution (148 mM NaCl, 4 mM KCl, 0.5 mM MgCl₂·6H₂O, 0.3 mM NaPH₂O₄·H₂O, 5 mM HEPES, 10 mM d-Glucose, 1.8 mM CaCl₂·H₂O, pH adjusted to 7.4 with NaOH). Fluorescence images were acquired in 1 × Normal Tyrode’s solution immediately after the wash using ImageXpress Micro XLS System (Molecular Devices) with excitation at 488 nm and emission at 515–600 nm at a frequency of 5 frames/sec and 20× magnification for 12 or 32 s. Fluorescence intensity plots from spontaneously beating cells were obtained using MetaXpress software (Molecular Devices) by region of interest measurements.

Ca²⁺ transients from SCVI-273-derived CMs with and without TNF-a treatment and IMR90-derived CMs with and without ethanol treatment were used as training datasets for machine learning algorithm. Ca²⁺ transients from SCVI-273-derived CMs with or without melphalan treatment were used as test datasets for machine learning algorithm.

We note that treatment conditions we tested (TNF-α, ethanol, and melphalan) caused abnormal Ca²⁺ transients (Fig. 2c and Rampoldi et al.²⁵) with patterns similar to those observed in patient-derived hiPSC-CMs (e.g., catecholaminergic polymorphic ventricular tachycardia⁸). We also note that the patterns of Ca²⁺ transients at days 23–25 observed in this study were similar to those from cells of 30 ± 2 days old⁸.

R-package

An R-package for the Ca²⁺ transient analysis described in this study, called SVMCaT, is available on the Github website (https://github.com/hyunmhwang/SVMCaT).

References

Burridge, P. W., Keller, G., Gold, J. D. & Wu, J. C. Production of de novo cardiomyocytes: Human pluripotent stem cell differentiation and direct reprogramming. Cell Stem Cell 10, 16–28 (2012).
Article CAS Google Scholar
Lundy, S. D., Zhu, W. Z., Regnier, M. & Laflamme, M. A. Structural and functional maturation of cardiomyocytes derived from human pluripotent stem cells. Stem Cells Dev. 22, 1991–2002 (2013).
Article CAS Google Scholar
Yamashita, J. K. ES and iPS cell research for cardiovascular regeneration. Exp. Cell Res. 316, 2555–2559 (2010).
Article CAS Google Scholar
Harris, K. et al. Comparison of electrophysiological data from human-induced pluripotent stem cell-derived cardiomyocytes to functional preclinical safety assays. Toxicol. Sci. 134, 412–426 (2013).
Article CAS Google Scholar
Itzhaki, I. et al. Modelling the long QT syndrome with induced pluripotent stem cells. Nature 471, 225–229 (2011).
Article ADS CAS Google Scholar
Sharma, A. et al. Human induced pluripotent stem cell-derived cardiomyocytes as an in vitro model for coxsackievirus B3-induced myocarditis and antiviral drug screening platform. Circ. Res. 115, 556–566 (2014).
Article CAS Google Scholar
Mordwinkin, N. M., Burridge, P. W. & Wu, J. C. A review of human pluripotent stem cell-derived cardiomyocytes for high-throughput drug discovery, cardiotoxicity screening, and publication standards. J. Cardiovasc. Transl. Res. 6, 22–30 (2013).
Article Google Scholar
Preininger, M. K. et al. A human pluripotent stem cell model of catecholaminergic polymorphic ventricular tachycardia recapitulates patient-specific drug responses. Dis. Model. Mech. 9, 927–939 (2016).
Article CAS Google Scholar
Kitani, T. et al. Human induced pluripotent stem cell model of trastuzumab-induced cardiac dysfunction in breast cancer patients. Circulation 139, 2451–2465 (2019).
Article CAS Google Scholar
Zhang, D. et al. Tissue-engineered cardiac patch for advanced functional maturation of human ESC-derived cardiomyocytes. Biomaterials 34, 5813–5820 (2013).
Article CAS Google Scholar
Chong, J. J. et al. Human embryonic-stem-cell-derived cardiomyocytes regenerate non-human primate hearts. Nature 510, 273–277 (2014).
Article ADS CAS Google Scholar
Laurila, E., Ahola, A., Hyttinen, J. & Aalto-Setala, K. Methods for in vitro functional analysis of iPSC derived cardiomyocytes—Special focus on analyzing the mechanical beating behavior. Biochim. Biophys. Acta 1863, 1864–1872 (2016).
Article CAS Google Scholar
Landstrom, A. P., Dobrev, D. & Wehrens, X. H. T. Calcium signaling and cardiac arrhythmias. Circ. Res. 120, 1969–1993 (2017).
Article CAS Google Scholar
Juhola, M. et al. Signal analysis and classification methods for the calcium transient data of stem cell-derived cardiomyocytes. Comput. Biol. Med. 61, 1–7 (2015).
Article CAS Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Machine Learning 20, 273–297 (1995).
MATH Google Scholar
Atkinson, K. E. An Introduction to Numerical Analysis 2nd edn. (Wiley, New York, 1989).
MATH Google Scholar
Molinaro, A. M., Simon, R. & Pfeiffer, R. M. Prediction error estimation: A comparison of resampling methods. Bioinformatics 21, 3301–3307 (2005).
Article CAS Google Scholar
Fawcett, T. An introduction to ROC analysis. Pattern Recogn. Lett. 27, 861–874 (2006).
Article Google Scholar
Juhola, M., Joutsijoki, H., Penttinen, K. & Aalto-Setala, K. Detection of genetic cardiac diseases by Ca(2+) transient profiles using machine learning methods. Sci. Rep. 8, 9355 (2018).
Article ADS Google Scholar
Yu, J. et al. Induced pluripotent stem cell lines derived from human somatic cells. Science 318, 1917–1920 (2007).
Article ADS CAS Google Scholar
Jha, R., Xu, R. H. & Xu, C. Efficient differentiation of cardiomyocytes from human pluripotent stem cells with growth factors. Methods Mol. Biol. 1299, 115–131 (2015).
Article CAS Google Scholar
Laflamme, M. A. et al. Cardiomyocytes derived from human embryonic stem cells in pro-survival factors enhance function of infarcted rat hearts. Nat. Biotechnol. 25, 1015–1024 (2007).
Article CAS Google Scholar
Tohyama, S. et al. Distinct metabolic flow enables large-scale purification of mouse and human pluripotent stem cell-derived cardiomyocytes. Cell Stem Cell 12, 127–137 (2013).
Article CAS Google Scholar
Jha, R. et al. Simulated microgravity and 3D culture enhance induction, viability, proliferation and differentiation of cardiac progenitors from human pluripotent stem cells. Sci. Rep. 6, 30956 (2016).
Article ADS CAS Google Scholar
Rampoldi, A. et al. Cardiac toxicity from ethanol exposure in human-induced pluripotent stem cell-derived cardiomyocytes. Toxicol. Sci. 169, 280–292 (2019).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Anita Saraf and Antonio Rampoldi at the Division of Pediatric Cardiology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta for their support on cell culture. We also thank Myra S. Chao at Emory University College of Arts and Sciences for her help with data collection.

Funding

This study was supported by the Center for Pediatric Technology at Emory University and Georgia Institute of Technology; Imagine, Innovate and Impact (I3) Funds from the Emory School of Medicine and through the Georgia CTSA NIH award [UL1-TR002378]; Biolocity at Emory University & Georgia Institute of Technology; National Science Foundation-Center for the Advancement of Science in Space [CBET 1926387]; and the National Institutes of Health [R21AA025723 and R01HL136345].

Author information

These authors contributed equally: Hyun Hwang and Rui Liu.

Authors and Affiliations

Division of Pediatric Cardiology, Department of Pediatrics, Emory University School of Medicine and Children’s Healthcare of Atlanta, Atlanta, GA, 30322, USA
Hyun Hwang, Rui Liu, Joshua T. Maxwell & Chunhui Xu
Department of Pediatrics, The Third Xiangya Hospital of Central South University, Changsha, 410013, Hunan, China
Rui Liu
Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, 30322, USA
Jingjing Yang
Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, 30033, USA
Chunhui Xu

Authors

Hyun Hwang

View author publications

You can also search for this author in PubMed Google Scholar
Rui Liu

View author publications

You can also search for this author in PubMed Google Scholar
Joshua T. Maxwell

View author publications

You can also search for this author in PubMed Google Scholar
Jingjing Yang

View author publications

You can also search for this author in PubMed Google Scholar
Chunhui Xu

View author publications

You can also search for this author in PubMed Google Scholar

Contributions

R.L. and J.T.M. performed the Ca²⁺ transient assay and expert signal assessments. H.H. and J.Y. performed the computational analysis. H.H., R.L., J.T.M., J.Y., and C.X. wrote the manuscript.

Corresponding authors

Correspondence to Jingjing Yang or Chunhui Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hwang, H., Liu, R., Maxwell, J.T. et al. Machine learning identifies abnormal Ca²⁺ transients in human induced pluripotent stem cell-derived cardiomyocytes. Sci Rep 10, 16977 (2020). https://doi.org/10.1038/s41598-020-73801-x

Download citation

Received: 04 June 2020
Accepted: 22 September 2020
Published: 12 October 2020
DOI: https://doi.org/10.1038/s41598-020-73801-x

This article is cited by

Engineered platforms for mimicking cardiac development and drug screening
- Madison Stiefbold
- Haokang Zhang
- Leo Q. Wan
Cellular and Molecular Life Sciences (2024)
Machine Learning Approaches for Stem Cells
- Mazlee Mazalan
- Tien-Dung Do
- Effirul I. Ramlan
Current Stem Cell Reports (2023)
Moving Towards Induced Pluripotent Stem Cell-based Therapies with Artificial Intelligence and Machine Learning
- Claudia Coronnello
- Maria Giovanna Francipane
Stem Cell Reviews and Reports (2022)
Cardiovascular Imaging Databases: Building Machine Learning Algorithms for Regenerative Medicine
- Chirag Mehta
- Rohan Shah
- Partho P. Sengupta
Current Stem Cell Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Study overview

Data preprocessing

Abnormality assessment by human experts

Peak detection

Peak variable quantification

Peak abnormality assessment by improved analytical algorithm

Train peak-level SVM classifier

Train cell-level SVM classifier

Application studies

Discussion

Methods

Culture of hiPSCs and cardiomyocyte differentiation

Ca2+ transient assay

R-package

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links

Ca²⁺ transient assay