Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

SVAD: A genetic database curates non-ischemic sudden cardiac death-associated variants

  • Wei-Chih Huang ,

    Contributed equally to this work with: Wei-Chih Huang, Hsin-Tzu Huang

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Project administration, Writing – original draft, Writing – review & editing

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Hsin-Tzu Huang ,

    Contributed equally to this work with: Wei-Chih Huang, Hsin-Tzu Huang

    Roles Conceptualization, Investigation, Project administration, Writing – original draft, Writing – review & editing

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Industrial Development Graduate Program of College of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Po-Yuan Chen,

    Roles Formal analysis, Validation

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Wei-Chi Wang,

    Roles Conceptualization, Formal analysis

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Tai-Ming Ko,

    Roles Conceptualization, Supervision, Writing – review & editing

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan, R.O.C, Graduate Institute of Integrated Medicine, College of Chinese Medicine, China Medical University, Taichung, Taiwan, R.O.C

  • Sirjana Shrestha,

    Roles Investigation

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Chi-Dung Yang,

    Roles Investigation

    Affiliations Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, China, School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen, China

  • Chun-San Tai,

    Roles Investigation

    Affiliations Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C, Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Men-Yee Chiew,

    Roles Investigation

    Affiliation Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Yu-Pao Chou,

    Roles Investigation

    Affiliation Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan, R.O.C

  • Yu-Feng Hu ,

    Roles Conceptualization, Funding acquisition, Resources, Supervision

    huanghsienda@cuhk.edu.cn (HDH); huhuhu0609@gmail.com (YFH)

    Affiliations Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taipei, Taiwan, R.O.C, Institute of Clinical Medicine, and Cardiovascular Research Center, National Yang-Ming University, Taipei, Taiwan, R.O.C

  • Hsien-Da Huang

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision

    huanghsienda@cuhk.edu.cn (HDH); huhuhu0609@gmail.com (YFH)

    Affiliations Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, China, School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen, China

Abstract

Sudden cardiac death (SCD) is an important cause of mortality worldwide. It accounts for approximately half of all deaths from cardiovascular disease. While coronary artery disease and acute myocardial infarction account for the majority of SCD in the elderly population, inherited cardiac diseases (inherited CDs) comprise a substantial proportion of younger SCD victims with a significant genetic component. Currently, the use of next-generation sequencing enables the rapid analysis to investigate relationships between genetic variants and inherited CDs causing SCD. Genetic contribution to risk has been considered an alternate predictor of SCD. In the past years, large numbers of SCD susceptibility variants were reported, but these results are scattered in numerous publications. Here, we present the SCD-associated Variants Annotation Database (SVAD) to facilitate the interpretation of variants and to meet the needs of data integration. SVAD contains data from a broad screening of scientific literature. It was constructed to provide a comprehensive collection of genetic variants along with integrated information regarding their effects. At present, SVAD has accumulated 2,292 entries within 1,239 variants by manually surveying pertinent literature, and approximately one-third of the collected variants are pathogenic/likely-pathogenic following the ACMG guidelines. To the best of our knowledge, SVAD is the most comprehensive database that can provide integrated information on the associated variants in various types of inherited CDs. SVAD represents a valuable source of variant information based on scientific literature and benefits clinicians and researchers, and it is now available on http://svad.mbc.nctu.edu.tw/.

Introduction

Sudden cardiac death (SCD) is defined as the unexpected death occurring within 1 h from the onset of symptoms in a subject with no known prior fatal condition [1, 2]. SCD is a major public health issue, accounting for approximately half of all deaths from cardiovascular diseases [3, 4]. In East Asia, particularly in China [3, 5], Japan [3, 6], and Taiwan [3, 7], the incidence of SCD per 100,000 cases per year was 41.8, 14.9, and 28.4, respectively. By contrast, the annual incidence of SCD ranges from 50 to 100 in the USA and Europe, which is relatively higher than that in East Asia [3, 8]. Sudden cardiac death in the young is a devastating event. The incidence of SCD in the younger population (< 40 years) is 1.8–2.8/100,000 individuals per year, and it makes up a significant proportion of the mortality in this age group [2, 916]. While coronary artery disease and acute myocardial infarction account for the majority of SCD in the elderly population, inherited cardiac diseases (inherited CDs) comprise a substantial proportion of younger SCD victims with a significant genetic component [14, 1723]. In half of young SCD victims, the etiology has been reported to be inherited CDs [2, 24]. Inherited cardiac diseases include cardiomyopathies (e.g., hypertrophic cardiomyopathy (HCM), dilated and restrictive cardiomyopathies, arrhythmogenic right ventricular cardiomyopathy (ARVC) and left ventricular non-compaction) and channelopathies (e.g., long QT syndrome (LQTS), Brugada syndrome (BrS), catecholaminergic polymorphic ventricular tachycardia (CPVT), idiopathic ventricular fibrillation and short-QT syndrome). Cardiomyopathy begins to dominate in older children and young adults [2527]. HCM remains the most common structural cause of SCD in the young [28], but ARVC might cause SCD in up to 25% in some countries [26]. In the United States, the most common causes of SCD in the young are HCM (~40% to 50%), arrhythmias (~20%), other cardiomyopathies (~10%), and others [29].

Identifying the genetic factors predisposing to SCD is important, and genetic biomarkers are considered alternate predictors because the majority of patients hold preserved cardiac function [4]. Owing to the recent developments in sequencing, the use of next-generation sequencing (NGS) enables the rapid analysis of many genes responsible for inherited CDs [30]. NGS allows a fast and cost-effective approach for genetic screening of a large set of genes. It is rapidly applied to clinical practice and allows scientists to investigate the genetic variants contributing to specific phenotypes when combined with large-scale annotated genetic databases [31]. At present, clinical research aims to exploit the potential of genetic variation as risk predictors or biomarkers to prevent SCD based on the clinical stages of patients and their relatives [3236]. Therefore, genetic testing is considered practicable in early diagnosis, prognostic stratification, and therapeutic interventions [36, 37]. In a 5.5-year follow-up study, it showed that the overall diagnostic yield of inherited CDs in 304 SCD families was 47%, and most diagnoses identified in the relatives were related to the diagnosis in the proband [38]. In 2015, the American College of Medical Genetics together with the Association of Molecular Pathology (ACMG-AMP) published guidelines to set standards on determining the pathogenicity of variants [39]. More importantly, a committee of the European Society of Human Genetics (ESHG) developed recommendations on how to integrate genetic testing into multidisciplinary management of SCD [40]. However, the clinical interpretation of identified variants remains a challenge because of scattered or insufficient evidence supporting their pathogenic effects.

In the past years, numerous studies aiming to explore the genetic susceptibility of non-ischemic SCD were published. Large numbers of susceptibility variants and genes have been reported to be disease-associated. However, these results are scattered in numerous publications and are sometimes inconsistent because of differences in the allele frequency among different populations [41]. A centralized information repository for a comprehensive and well-organized collection of genetic data from multiple published studies is urgently needed to provide lots of evidence to clarify the genetic predisposition to non-ischemic SCD. Here, we present the SCD-associated Variants Annotation Database (SVAD) to facilitate the interpretation of variants and to fulfill the needs of data integration for non-ischemic SCD caused by cardiomyopathies and channelopathies. In the present work, literature that mentioned the associations between genetic variants and one of the inherited CDs (HCM, ARVC, LQTS, BrS and CPVT) causing non-ischemic SCD was searched and collected. From the selected papers, we manually collected details on the type of variation and information. We also provided data from in silico prediction methods to aid the interpretation of variations with amino acid change; thus, we can classify whether the published variants are deleterious or not. SVAD currently contains approximately 2,300 entries within 1,239 distinct variants of 12 key genes associated with non-ischemic SCD, which were referenced from 232 published studies. In general, SVAD is designed to help unveil the genetic basis of SCD caused by inherited CDs.

Materials and methods

Literature collection and data integration

In our database, the associations between non-ischemic SCD caused by inherited CDs and genetic variants were provided and derived from full-text literature reading with manual curation of these genetic studies. It was presently focused on five types of inherited CD (i.e., HCM, ARVC, LQTS, BrS and CPVT) causing non-ischemic SCD, and associated literature was searched in the Entrez PubMed (http://www.ncbi.nih.gov/pubmed) using “hypertrophic cardiomyopathy”, “arrhythmogenic right ventricular cardiomyopathy”, “long QT syndrome”, “Brugada syndrome”, “catecholaminergic polymorphic ventricular tachycardia” and “Date—Publication from 2011/01/01 to 2018/03/31” as search terms. A total of 1,077 articles with available pdf files were collected. Articles and reviews mentioned about associations of genetic variation and SCD were kept for further reading by the manual screening of these publications. Initially, 4,033 entries were collected from 368 articles. Next, the collected variants located in the coding sequence (CDS) of 12 important genes related to HCM, ARVC, LQTS, BrS or CPVT (i.e., MYBPC3, MYH7, DSP, PKP2, CACNA1C, CACNB2, KCNE1, KCNE2, KCNH2, KCNQ1, SCN5A, and RYR2) were extracted for further curation and data integration.

Integrated variant information includes: associated inherited CD; located gene; chromosomal location (human reference genome version GRCh37); reference transcript ID in RefSeq; amino acid change and coding DNA change based on the reference transcript sequence; types of alteration (nonsense, missense, insertion-deletion or synonymous); the Human Genome Variation Society (HGVS) nomenclature of cDNA or protein sequence; corresponding SNP ID in dbSNP (if available) [42]; ethnicity and number of cases in each reference literature; PubMed ID of reference literature; in silico functional prediction for variants with amino acid change using the Combined Annotation Dependent Depletion (CADD) tool [43]; interpretation in ClinVar (if available) [44]; reported classification following the ACMG guideline (e.g., pathogenic, likely-pathogenic, variant of uncertain significance, likely-benign or benign) from VarSome [45]; allele frequency of each variant in different populations retrieved from the ExAC database [46], the 1000 Genome Project [47] and the Taiwan Biobank [48] (Table 1).

thumbnail
Table 1. Integrated features in SVAD.

The features were integrated from various public databases and were curated for presenting in browse page and result page.

https://doi.org/10.1371/journal.pone.0237731.t001

Independent studies usually describe genetic variation using amino acid changes or cDNA changes in a gene without indicating the reference transcript nor clearly describing the chromosomal position of the gene. To overcome this problem, all variants were described according to the HGVS nomenclature guidelines [49], the nomenclatures for variants were curated by dbSNP, ClinVar, and VarSome, and chromosomal location of variants following human reference genome version GRCh37 was used. Functional prediction by CADD can serve to improve the interpretation of genetic variants. Relationships among genetic variants and phenotypes with supporting evidence are retrieved from ClinVar. Classification of variants following the ACMG guideline can help researchers evaluate pathogenicity. Additionally, the ethnicity of cases in reference literature and allele frequency of variants in different populations were presented because the distribution of inherited CD-associated variants might be different in varied populations [41].

Results

SVAD data statistics

SVAD is now available at http://svad.mbc.nctu.edu.tw/ and will be updated annually and the updates applied in the database are reported in the “Latest news” archive on the homepage. In the current release of SVAD, a total of 2,292 entries within 1,239 variants located in the CDS of 12 key inherited CD-related genes were extracted from 232 articles (Table 2). The key genes were selected according to our ICDscreening panel (unpublished), and this panel included only the established genes with significant clinical impact, high prevalence, and clear and relevant pathogenetic mechanisms. Most collected variations were missense type, accounting for 83.2% (1,031/1,239); percentage of nonsense, indel and synonymous variations were 5.7%, 9.5% and 1.5%, respectively (Table 2). Top 5 number of entries for genes were RYR2 (970/2,292, 42.3%), SCN5A (306/2,292, 13.4%), MYH7 (263/2,292, 11.5%), MYBPC3 (251/2,292, 11.0%) and KCNQ1 (201/2,292, 8.8%), and top 5 number of collected variants for genes were RYR2 (315/1,239, 25.4%), SCN5A (226/1,239, 18.2%), MYBPC3 (165/1,239, 13.3%), MYH7 (157/1,239, 12.7%) and KCNQ1 (123/1,239, 9.9%). For MYBPC3 and PKP2, their indel variations occupied above 20% of their total variations. The number of variants with different reported classifications for each gene were shown in Fig 1A. Variants of uncertain significance accounted for 64.4% of curated variations. Pathogenic and likely-pathogenic variants were 13.3% and 19.9%, respectively. The percentage of pathogenic variants and likely-pathogenic variants was higher in KCNQ1, MYH7, MYBPC3, KCNH2, and PKP2 than in others. A high proportion of pathogenic variants was observed in MYBPC3. Distribution of variants associated with various inherited CDs were shown in Fig 1B. Among these variants, most HCM-associated variants were derived from MYBPC3 and MYH7; most ARVC-associated variants were derived from PKP2 and DSP; most LQTS-associated variants were derived from KCNQ1, KCNH2 and SCN5A; most BrS-associated variants were derived from SCN5A and CACNA1C; most CPVT-associated variants were derived from RYR2. The percentage of pathogenic/likely-pathogenic variants was high in LQTS and HCM, accounting for 51.9% and 45.5%, respectively (Fig 2). Furthermore, 413 out of 1,239 variants, approximately one-third of collected variants, were pathogenic/likely-pathogenic in SVAD. Some pathogenic/likely-pathogenic variants were mentioned many times in literature, e.g. p.Arg176Gln variation of RYR2 (14 times), p.Arg719Trp variation of MYH7 (10 times) and p.Ile4867Met variation of RYR2 (9 times). Thus, once a subject carries these pathogenic/likely-pathogenic variants, regardless of whether he has obvious symptoms of inherited CD, is likely to be at high risk of non-ischemic SCD. The number of variants associated with various inherited CDs in different populations was listed in Table 3. Indeed, the majority of research results in non-ischemic SCD focus on the Caucasian population, accounting for 42.8%. However, it was indicated that the disease-associated variants with high incidence could vary in different populations [41]. In our opinion, the evaluation of variants for potential pathogenicity in different populations should depend on the allele frequency data derived from the corresponding population, not on the general population data.

thumbnail
Fig 1.

The number of variants (A) in different reported classifications or (B) in different inherited CDs causing non-ischemic SCD. (A) A total of 1,239 variants located in the CDS of 12 key genes associated with five types of inherited CD (HCM, ARVC, LQTS, BrS and CPVT) were curated. Variants of uncertain significance accounted for 64.4% of curated variations. Pathogenic and likely-pathogenic variants were 13.3% and 19.9%, respectively. A high proportion of pathogenic variants was observed in MYBPC3. (B) In HCM, a high proportion of non-ischemic SCD-associated variants of MYBPC3 and MYH7 was observed and some variants of SCN5A and RYR2 were shown. In ARVC, variants of PKP2 and DSP accounted for high proportion. In LQTS, most non-ischemic SCD-associated variants were derived from KCNQ1, KCNH2 and SCN5A. In BrS, variants of SCN5A accounted for the majority. In CPVT, the majority of non-ischemic SCD-variants belonged to RYR2. HCM: Hypertrophic cardiomyopathy; ARVC: Arrhythmogenic right ventricular cardiomyopathy; LQTS: Long QT syndrome; BrS: Brugada syndrome; CPVT: Catecholaminergic polymorphic ventricular tachycardia.

https://doi.org/10.1371/journal.pone.0237731.g001

thumbnail
Fig 2. Distribution of reported classifications of variants in different inherited CDs causing non-ischemic SCD.

Variants of uncertain significance accounted for the majority in these inherited CDs. The percentage of pathogenic/likely-pathogenic variants was high in LQTS and HCM, accounting for 51.9% and 45.5%, respectively. HCM: Hypertrophic cardiomyopathy; ARVC: Arrhythmogenic right ventricular cardiomyopathy; LQTS: Long QT syndrome; BrS: Brugada syndrome; CPVT: Catecholaminergic polymorphic ventricular tachycardia.

https://doi.org/10.1371/journal.pone.0237731.g002

thumbnail
Table 2. The number of collected articles, curated entries and variants in 12 selected genes reported in SVAD.

https://doi.org/10.1371/journal.pone.0237731.t002

thumbnail
Table 3. Distribution of curated variants associated with various inherited CDs causing non-ischemic SCD in different populations.

https://doi.org/10.1371/journal.pone.0237731.t003

SVAD Web interface

SVAD provides a user-friendly web interface (Fig 3). It presents several search functions for users to facilitate the access of inherited CD-associated variants, including search by diseases, genes, variants, and reported classification (Fig 3A). Upon browsing, every 10 results are shown on each page, thereby providing users the opportunity to know whether a variant has been identified and enabling users to search for inherited CD-associated or pathogenic variants quickly and conveniently. Users can input keywords at the upper-right search box in browse page to quickly query for the variants of interest. Keywords should be separated by a whitespace character. For example, only the variations of MYBPC3 in HCM would be shown when the keywords “HCM MYBPC3” were inputted (Fig 3B). Here, a result page is designed to describe variants, where each variant is assigned an SVAD accession ID. The detailed genetic information of each variant, including population frequency, is shown in the result page when the user clicks the SVAD accession ID (Fig 3C).

thumbnail
Fig 3. The web interface of SVAD.

(A) Users can use keywords such as type of inherited CD abbreviation, gene name, SNP ID, nucleotide/amino acid change or reported classification to query genetic variants of interest. (B) In the browse page, users can change the ordering of results by clicking each column name of the table. At the upper-right search box in the browse page, users can input keywords to quickly query the variants of interest. (C) Detailed information of each variant is shown when clicking on the accession ID.

https://doi.org/10.1371/journal.pone.0237731.g003

Comparison with existing resources

To the best of our knowledge, SVAD is the most comprehensive database providing integrated information of variants associated with non-ischemic SCD by manually surveying pertinent literature. In SVAD, integrated information about variants associated with non-ischemic SCD is freely available, such as associated inherited CD, reported classification, clinical significance, predicted functional change, and population allele frequencies. We believe that the comprehensive collection of variant information in SCDs could valuably facilitate the interpretation of genetic data and complement the unmet clinical needs.

As compared to the ARVD/C Genetic Variants Database, which is the only database of inherited CD with variant information, it provided genetic information on only ARVC-related genes and their variants [50]. A total of 1,426 variants located in 12 genes were retrieved from 172 articles. Although it included much information to present a comprehensive view for ARVC, it did not focus on relationships among variants and non-ischemic SCD and did not take allele frequency in various populations and clinical evidence from ClinVar into account. Information of these two factors are important to variant classification. Several other databases provide information on disease-associated variations, but not specific for inherited CD or non-ischemic SCD. The Human Gene Mutation Database (HGMD) constitutes a comprehensive collection of genetic variants that are causally associated with a phenotype or disease [51]. However, there is limited and out-of-date information in the public version of the HGMD [52, 53]. Users must purchase a license of HGMD®Professional to obtain detailed and integrated information. ClinVar is a freely accessible archive of human genetic variants and interpretations of their relationships to disease and it becomes a valuable resource for clinical genetics research. Interpretations of variants are manually reviewed and curated by experts, but it takes much time for this task. There are lots of variants and diseases included in ClinVar, so that interpretations of the variants associated with non-ischemic SCD from the latest research cannot be expected to update promptly. Additionally, some problems in ClinVar are mentioned, such as classification discrepancies between ClinVar and laboratories [54, 55], and out‐of‐date interpretations of some variants [54]. VarSome integrates massive information from multiple databases to provide a comprehensive view for human variation and enables the community to freely and easily share knowledge on them [45]. If a user wants to realize details of the variants that he has collected, VarSome is a very convenient search engine and powerful database to provide information. Nevertheless, he cannot directly browse organized information about the variants associated with non-ischemic SCD. Another concern is that annotations of relationships between diseases and variants reported by users’ contribution could be insufficient owing to minor usage of users studied in non-ischemic SCD.

Discussions

Summary

This work devises a frequently updated database called SVAD by continuously surveying pertinent research articles to make the database become a major repository for linking associations of non-ischemic SCD and human genetic variants. A total of 2,292 entries within 1,239 associated genetic variants in 12 key genes were included from 232 articles. The key genes were selected according to our ICDscreening panel (unpublished), and this panel included only the established genes with significant clinical impact, high prevalence, and clear and relevant pathogenetic mechanisms. The SVAD currently represents the most comprehensive source of information regarding non-ischemic SCD-associated variants, thereby providing an overview of known genetic information. To investigate the relationships between disease and variants, information on the clinical significance of variants was retrieved from ClinVar. For elucidating the biological meanings of the reported nonsynonymous mutations, the CADD tool was implemented to provide in silico functional prediction of variants. To clarify whether a variation is a rare variant and to improve interpretation of variants in ethnically diverse populations, allele frequency data from the 1000 Genomes, ExAC and Taiwan Biobank is included. The pathogenicity of each genetic variant could be accurately emphasized when these data are integrated, and it may lead to new insights into the molecular mechanisms underlying inherited CDs in different subtypes or populations. This comprehensive collection of genetic data about non-ischemic SCD caused by inherited CD represents a valuable source of integrated information on the spectrum of disease-associated variations, thereby benefiting clinicians and researchers. Researchers and clinicians can rapidly verify whether the variation of interest has been published and obtain the supporting evidence of pathogenicity.

Limitations

There are some limitations to this work. At this first release, we focus on the 12 key genes, which are related to the five types of inherited CD (HCM, ARVC, LQTS, BrS, and CPVT) causing non-ischemic SCD, with significant clinical impact, high prevalence, and clear and relevant pathogenetic mechanisms. We are preparing for collecting articles and integrating information on the other related genes of these inherited CDs, and it is expected to complete at the next release. For the novel investigated genes, they are not included because their clinical significances, prevalence, and mechanisms might be unclear. We will evaluate the strength of supporting evidence of variants and provide an indicator for inclusion to describe their associations and importance in a further update. The inclusion of the five inherited CDs is not a constraint for our works. We will gradually link the associations of variants to other inherited CDs causing non-ischemic SCD, such as short-QT syndrome and dilated cardiomyopathy, in the next years. Presently, associations of the five types of inherited CD and variants were retrieved from the literature published from 2011 to 2018. The collection and curation of associations that appeared in literature published in previous years will be complemented as soon as possible. Additionally, a large number of related genes and a high proportion of family-specific variations in inherited CDs make it a complicated disorder [19]. Although cosegregation data is important for evaluating the pathogenicity strength of variants, it is dispersed in literature [56]. It is a complicated task and we will spend much time and effort to systematically collect, retrieve, and validate cosegregation data in numerous publications.

Perspective works

Three prospective works need to be performed in the near future. First, future work involving the proposed database should include more data about associations of genetic variants and other types of inherited CD. Second, the predicted functional status and pathogenicity of the probable inherited CD-associated variants will be regularly updated and revised. Third, to address the heterogeneity of studies, the impacts of varying allele frequency of variants in various populations should be evaluated. Bias is observed in the genetic studies of disease-associated variants because the majority of them are discovered in Caucasian populations [47, 57, 58]. Our collected data also show a similar phenomenon (Table 3). It was also indicated that the risk of SCD was also possibly influenced by race and ethnicity [59]. Besides, following the ACMG guideline, a variation is considered “benign” when its allele frequency is equal to or higher than 5% in a population [39]. Nevertheless, the discrepancy is observed in the allele frequency of variants in other understudied ethnically diverse populations [41]. Realizing differences of allele frequencies across populations could provide new insights into the pathogenicity of some specific variants, which could help in developing a scoring method for evaluating the influence of variants in various populations. In response to the rapid growth of genetic data, natural language processing techniques will be implemented to effectively screen a large number of studies to collect information about types of inherited CD, population, and the number of included samples, experimental methods, the panel of genes, disease-associated variants and cosegregation data. The established method will reduce the effort for the curators. Collecting evidence of molecular regulation from omics data is another direction to figure out the big picture of non-ischemic SCD.

Supporting information

S1 File. The list of curated genetic variants and PMID of available articles in SVAD.

https://doi.org/10.1371/journal.pone.0237731.s001

(TXT)

References

  1. 1. Zipes DP, Wellens HJ. Sudden cardiac death. Circulation. 1998;98(21):2334–51. pmid:9826323.
  2. 2. Risgaard B, Winkel BG, Jabbari R, Behr ER, Ingemann-Hansen O, Thomsen JL, et al. Burden of sudden cardiac death in persons aged 1 to 49 years: nationwide study in Denmark. Circulation Arrhythmia and electrophysiology. 2014;7(2):205–11. pmid:24604905.
  3. 3. Wong CX, Brown A, Lau DH, Chugh SS, Albert CM, Kalman JM, et al. Epidemiology of Sudden Cardiac Death: Global and Regional Perspectives. Heart, lung & circulation. 2019;28(1):6–14. pmid:30482683.
  4. 4. Zaman S, Goldberger JJ, Kovoor P. Sudden Death Risk-Stratification in 2018–2019: The Old and the New. Heart, lung & circulation. 2019;28(1):57–64. pmid:30482684.
  5. 5. Hua W, Zhang LF, Wu YF, Liu XQ, Guo DS, Zhou HL, et al. Incidence of sudden cardiac death in China: analysis of 4 regional populations. Journal of the American College of Cardiology. 2009;54(12):1110–8. pmid:19744622.
  6. 6. Kitamura T, Iwami T, Kawamura T, Nitta M, Nagao K, Nonogi H, et al. Nationwide improvements in survival from out-of-hospital cardiac arrest in Japan. Circulation. 2012;126(24):2834–43. pmid:23035209.
  7. 7. Cha WC, Lee SC, Shin SD, Song KJ, Sung AJ, Hwang SS. Regionalisation of out-of-hospital cardiac arrest care for patients without prehospital return of spontaneous circulation. Resuscitation. 2012;83(11):1338–42. pmid:22446564.
  8. 8. Murakoshi N, Aonuma K. Epidemiology of arrhythmias and sudden cardiac death in Asia. Circulation journal: official journal of the Japanese Circulation Society. 2013;77(10):2419–31. pmid:24067274.
  9. 9. Larsen MK, Christiansen SL, Hertz CL, Frank-Hansen R, Jensen HK, Banner J, et al. Targeted molecular genetic testing in young sudden cardiac death victims from Western Denmark. International journal of legal medicine. 2020;134(1):111–21. pmid:31729605.
  10. 10. Winkel BG, Risgaard B, Bjune T, Jabbari R, Lynge TH, Glinge C, et al. Gender differences in sudden cardiac death in the young-a nationwide study. BMC cardiovascular disorders. 2017;17(1):19. pmid:28061807; PubMed Central PMCID: PMC5219679.
  11. 11. Winkel BG, Risgaard B, Sadjadieh G, Bundgaard H, Haunso S, Tfelt-Hansen J. Sudden cardiac death in children (1–18 years): symptoms and causes of death in a nationwide setting. European heart journal. 2014;35(13):868–75. pmid:24344190.
  12. 12. Margey R, Roy A, Tobin S, O'Keane CJ, McGorrian C, Morris V, et al. Sudden cardiac death in 14- to 35-year olds in Ireland from 2005 to 2007: a retrospective registry. Europace: European pacing, arrhythmias, and cardiac electrophysiology: journal of the working groups on cardiac pacing, arrhythmias, and cardiac cellular electrophysiology of the European Society of Cardiology. 2011;13(10):1411–8. pmid:21798877.
  13. 13. Bardai A, Berdowski J, van der Werf C, Blom MT, Ceelen M, van Langen IM, et al. Incidence, causes, and outcomes of out-of-hospital cardiac arrest in children. A comprehensive, prospective, population-based study in the Netherlands. Journal of the American College of Cardiology. 2011;57(18):1822–8. pmid:21527156.
  14. 14. Winkel BG, Holst AG, Theilade J, Kristensen IB, Thomsen JL, Ottesen GL, et al. Nationwide study of sudden cardiac death in persons aged 1–35 years. European heart journal. 2011;32(8):983–90. pmid:21131293.
  15. 15. Papadakis M, Sharma S, Cox S, Sheppard MN, Panoulas VF, Behr ER. The magnitude of sudden cardiac death in the young: a death certificate-based review in England and Wales. Europace: European pacing, arrhythmias, and cardiac electrophysiology: journal of the working groups on cardiac pacing, arrhythmias, and cardiac cellular electrophysiology of the European Society of Cardiology. 2009;11(10):1353–8. pmid:19700472.
  16. 16. Vaartjes I, Hendrix A, Hertogh EM, Grobbee DE, Doevendans PA, Mosterd A, et al. Sudden death in persons younger than 40 years of age: incidence and causes. European journal of cardiovascular prevention and rehabilitation: official journal of the European Society of Cardiology, Working Groups on Epidemiology & Prevention and Cardiac Rehabilitation and Exercise Physiology. 2009;16(5):592–6. pmid:19587604.
  17. 17. Vatta M, Spoonamore KG. Use of genetic testing to identify sudden cardiac death syndromes. Trends in cardiovascular medicine. 2015;25(8):738–48. pmid:25864170.
  18. 18. Chanavat V, Janin A, Millat G. A fast and cost-effective molecular diagnostic tool for genetic diseases involved in sudden cardiac death. Clinica chimica acta; international journal of clinical chemistry. 2016;453:80–5. pmid:26688388.
  19. 19. Hellenthal N, Gaertner-Rommel A, Klauke B, Paluszkiewicz L, Stuhr M, Kerner T, et al. Molecular autopsy of sudden unexplained deaths reveals genetic predispositions for cardiac diseases among young forensic cases. Europace: European pacing, arrhythmias, and cardiac electrophysiology: journal of the working groups on cardiac pacing, arrhythmias, and cardiac cellular electrophysiology of the European Society of Cardiology. 2017;19(11):1881–90. pmid:29016939.
  20. 20. Jimenez-Jaimez J, Alcalde Martinez V, Jimenez Fernandez M, Bermudez Jimenez F, Rodriguez Vazquez Del Rey MDM, Perin F, et al. Clinical and Genetic Diagnosis of Nonischemic Sudden Cardiac Death. Rev Esp Cardiol (Engl Ed). 2017;70(10):808–16. pmid:28566242.
  21. 21. Semsarian C, Ingles J, Wilde AA. Sudden cardiac death in the young: the molecular autopsy and a practical approach to surviving relatives. European heart journal. 2015;36(21):1290–6. pmid:25765769.
  22. 22. Ranthe MF, Winkel BG, Andersen EW, Risgaard B, Wohlfahrt J, Bundgaard H, et al. Risk of cardiovascular disease in family members of young sudden cardiac death victims. European heart journal. 2013;34(7):503–11. pmid:23150455.
  23. 23. Kaltman JR, Thompson PD, Lantos J, Berul CI, Botkin J, Cohen JT, et al. Screening for sudden cardiac death in the young: report from a national heart, lung, and blood institute working group. Circulation. 2011;123(17):1911–8. pmid:21537007.
  24. 24. Ferrero-Miliani L, Holst AG, Pehrson S, Morling N, Bundgaard H. Strategy for clinical evaluation and screening of sudden cardiac death relatives. Fundamental & clinical pharmacology. 2010;24(5):619–35. pmid:20698891.
  25. 25. Aro AL, Chugh SS. Prevention of Sudden Cardiac Death in Children and Young Adults. Progress in pediatric cardiology. 2017;45:37–42. pmid:29056836; PubMed Central PMCID: PMC5646418.
  26. 26. Pilmer CM, Kirsh JA, Hildebrandt D, Krahn AD, Gow RM. Sudden cardiac death in children and adolescents between 1 and 19 years of age. Heart rhythm. 2014;11(2):239–45. pmid:24239636.
  27. 27. Meyer L, Stubbs B, Fahrenbruch C, Maeda C, Harmon K, Eisenberg M, et al. Incidence, causes, and survival trends from cardiovascular-related sudden cardiac arrest in children and young adults 0 to 35 years of age: a 30-year review. Circulation. 2012;126(11):1363–72. pmid:22887927.
  28. 28. Maron BJ. Sudden death in young athletes. The New England journal of medicine. 2003;349(11):1064–75. pmid:12968091.
  29. 29. Maron BJ, Doerer JJ, Haas TS, Tierney DM, Mueller FO. Sudden deaths in young competitive athletes: analysis of 1866 deaths in the United States, 1980–2006. Circulation. 2009;119(8):1085–92. pmid:19221222.
  30. 30. Pua CJ, Bhalshankar J, Miao K, Walsh R, John S, Lim SQ, et al. Development of a Comprehensive Sequencing Assay for Inherited Cardiac Condition Genes. Journal of cardiovascular translational research. 2016;9(1):3–11. pmid:26888179; PubMed Central PMCID: PMC4767849.
  31. 31. Allegue C, Coll M, Mates J, Campuzano O, Iglesias A, Sobrino B, et al. Genetic Analysis of Arrhythmogenic Diseases in the Era of NGS: The Complexity of Clinical Decision-Making in Brugada Syndrome. PloS one. 2015;10(7):e0133037. pmid:26230511; PubMed Central PMCID: PMC4521779.
  32. 32. Bauce B, Nava A, Beffagna G, Basso C, Lorenzon A, Smaniotto G, et al. Multiple mutations in desmosomal proteins encoding genes in arrhythmogenic right ventricular cardiomyopathy/dysplasia. Heart rhythm. 2010;7(1):22–9. pmid:20129281.
  33. 33. Girolami F, Ho CY, Semsarian C, Baldi M, Will ML, Baldini K, et al. Clinical features and outcome of hypertrophic cardiomyopathy associated with triple sarcomere protein gene mutations. Journal of the American College of Cardiology. 2010;55(14):1444–53. pmid:20359594.
  34. 34. Judge DP, Rouf R. Use of genetics in the clinical evaluation and management of heart failure. Current treatment options in cardiovascular medicine. 2010;12(6):566–77. pmid:21063933.
  35. 35. Ackerman MJ, Priori SG, Willems S, Berul C, Brugada R, Calkins H, et al. HRS/EHRA expert consensus statement on the state of genetic testing for the channelopathies and cardiomyopathies: this document was developed as a partnership between the Heart Rhythm Society (HRS) and the European Heart Rhythm Association (EHRA). Europace: European pacing, arrhythmias, and cardiac electrophysiology: journal of the working groups on cardiac pacing, arrhythmias, and cardiac cellular electrophysiology of the European Society of Cardiology. 2011;13(8):1077–109. pmid:21810866.
  36. 36. Priori SG, Blomstrom-Lundqvist C, Mazzanti A, Blom N, Borggrefe M, Camm J, et al. 2015 ESC Guidelines for the management of patients with ventricular arrhythmias and the prevention of sudden cardiac death: The Task Force for the Management of Patients with Ventricular Arrhythmias and the Prevention of Sudden Cardiac Death of the European Society of Cardiology (ESC). Endorsed by: Association for European Paediatric and Congenital Cardiology (AEPC). European heart journal. 2015;36(41):2793–867. pmid:26320108.
  37. 37. Sturm AC. Genetic testing in the contemporary diagnosis of cardiomyopathy. Current heart failure reports. 2013;10(1):63–72. pmid:23135967.
  38. 38. Hansen BL, Jacobsen EM, Kjerrumgaard A, Tfelt-Hansen J, Winkel BG, Bundgaard H, et al. Diagnostic yield in victims of sudden cardiac death and their relatives. Europace: European pacing, arrhythmias, and cardiac electrophysiology: journal of the working groups on cardiac pacing, arrhythmias, and cardiac cellular electrophysiology of the European Society of Cardiology. 2020. pmid:32307520.
  39. 39. Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in medicine: official journal of the American College of Medical Genetics. 2015;17(5):405–24. pmid:25741868; PubMed Central PMCID: PMC4544753.
  40. 40. Fellmann F, van El CG, Charron P, Michaud K, Howard HC, Boers SN, et al. European recommendations integrating genetic testing into multidisciplinary management of sudden cardiac death. European journal of human genetics: EJHG. 2019. pmid:31235869.
  41. 41. Itoh H, Dochi K, Shimizu W, Denjoy I, Ohno S, Aiba T, et al. A Common Mutation of Long QT Syndrome Type 1 in Japan. Circulation journal: official journal of the Japanese Circulation Society. 2015;79(9):2026–30. pmid:26118460.
  42. 42. Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, et al. dbSNP: the NCBI database of genetic variation. Nucleic acids research. 2001;29(1):308–11. pmid:11125122; PubMed Central PMCID: PMC29783.
  43. 43. Kircher M, Witten DM, Jain P, O'Roak BJ, Cooper GM, Shendure J. A general framework for estimating the relative pathogenicity of human genetic variants. Nature genetics. 2014;46(3):310–5. pmid:24487276; PubMed Central PMCID: PMC3992975.
  44. 44. Landrum MJ, Lee JM, Riley GR, Jang W, Rubinstein WS, Church DM, et al. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic acids research. 2014;42(Database issue):D980–5. pmid:24234437; PubMed Central PMCID: PMC3965032.
  45. 45. Kopanos C, Tsiolkas V, Kouris A, Chapple CE, Albarca Aguilera M, Meyer R, et al. VarSome: the human genomic variant search engine. Bioinformatics. 2019;35(11):1978–80. pmid:30376034; PubMed Central PMCID: PMC6546127.
  46. 46. Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91. pmid:27535533; PubMed Central PMCID: PMC5018207.
  47. 47. Genomes Project C, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. pmid:26432245; PubMed Central PMCID: PMC4750478.
  48. 48. Lin JC, Fan CT, Liao CC, Chen YS. Taiwan Biobank: making cross-database convergence possible in the Big Data era. GigaScience. 2018;7(1):1–4. pmid:29149267; PubMed Central PMCID: PMC5774504.
  49. 49. den Dunnen JT, Dalgleish R, Maglott DR, Hart RK, Greenblatt MS, McGowan-Jordan J, et al. HGVS Recommendations for the Description of Sequence Variants: 2016 Update. Human mutation. 2016;37(6):564–9. pmid:26931183.
  50. 50. Lazzarini E, Jongbloed JD, Pilichou K, Thiene G, Basso C, Bikker H, et al. The ARVD/C genetic variants database: 2014 update. Human mutation. 2015;36(4):403–10. pmid:25676813.
  51. 51. Stenson PD, Mort M, Ball EV, Shaw K, Phillips A, Cooper DN. The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine. Human genetics. 2014;133(1):1–9. pmid:24077912; PubMed Central PMCID: PMC3898141.
  52. 52. Stenson PD, Mort M, Ball EV, Evans K, Hayden M, Heywood S, et al. The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Human genetics. 2017;136(6):665–77. pmid:28349240; PubMed Central PMCID: PMC5429360.
  53. 53. Genest J. Access Denied: The Controversy of Commercial Genetic Databases. The Canadian journal of cardiology. 2016;32(11):1295–6. pmid:27161824.
  54. 54. Landrum MJ, Kattman BL. ClinVar at five years: Delivering on the promise. Human mutation. 2018;39(11):1623–30. pmid:30311387.
  55. 55. Gradishar W, Johnson K, Brown K, Mundt E, Manley S. Clinical Variant Classification: A Comparison of Public Databases and a Commercial Testing Laboratory. The oncologist. 2017;22(7):797–803. pmid:28408614; PubMed Central PMCID: PMC5507641.
  56. 56. Amin AS, Wilde AAM. The future of sudden cardiac death research. Progress in pediatric cardiology. 2017;45:49–54.
  57. 57. Andreasen C, Nielsen JB, Refsgaard L, Holst AG, Christensen AH, Andreasen L, et al. New population-based exome data are questioning the pathogenicity of previously cardiomyopathy-associated genetic variants. European journal of human genetics: EJHG. 2013;21(9):918–28. pmid:23299917; PubMed Central PMCID: PMC3746259.
  58. 58. Manrai AK, Funke BH, Rehm HL, Olesen MS, Maron BA, Szolovits P, et al. Genetic Misdiagnoses and the Potential for Health Disparities. The New England journal of medicine. 2016;375(7):655–65. pmid:27532831; PubMed Central PMCID: PMC5292722.
  59. 59. Reinier K, Rusinaru C, Chugh SS. Race, ethnicity, and the risk of sudden death(). Trends in cardiovascular medicine. 2019;29(2):120–6. pmid:30029848; PubMed Central PMCID: PMC6324984.