Association of an IGHV3-66 gene variant with Kawasaki disease...
AbstractIn a meta-analysis of three GWAS for susceptibility to Kawasaki disease (KD) conducted in Japan, Korea, and Taiwan and follow-up studies with a total of 11,265 subjects (3428 cases and 7837 controls), a significantly associated SNV in the immunoglobulin heavy variable gene (IGHV) cluster in 14q33.32 was identified (rs4774175; OR鈥?鈥?.20, P鈥?鈥?.0鈥壝椻€?0鈭?). Investigation of nonsynonymous SNVs of the IGHV cluster in 9335 Japanese subjects identified the C allele of rs6423677, located in IGHV3-66, as the most significant reproducible association (OR鈥?鈥?.25, P鈥?鈥?.8鈥壝椻€?0鈭?0 in 3603 cases and 5731 controls). We observed highly skewed allelic usage of IGHV3-66, wherein the rs6423677 A allele was nearly abolished in the transcripts in peripheral blood mononuclear cells of both KD patients and healthy adults. Association of the high-expression allele with KD strongly indicates some active roles of B-cells or endogenous immunoglobulins in the disease pathogenesis. Considering that significant association of SNVs in the IGHV region with disease susceptibility was previously known only for rheumatic heart disease (RHD), a complication of acute rheumatic fever (ARF), these observations suggest that common B-cell related mechanisms may mediate the symptomology of KD and ARF as well as RHD. IntroductionKawasaki disease (KD) is an acute systemic vasculitis syndrome characterized by high fever, bilateral conjunctivitis, polymorphous skin rash, reddening of lips and oral cavity, changes in extremities and nonsuppurative cervical lymphadenopathy [1] and predominantly affects infants and children younger than 5 years [2]. In many cases, KD is self-limiting. However, it causes coronary artery complications such as dilatations and aneurysms (coronary artery lesions; CALs) in 20鈥?5% of untreated patients [3]. Replacing acute rheumatic fever (ARF), KD is now the leading cause of acquired heart diseases of children in developed countries [4]. Most KD patients are treated with high dose intravenous immunoglobulin (IVIG) infusion combined with oral aspirin, which was established in the 1980s and 1990s and has become a standard treatment that is effective at resolving inflammation and reducing CALs [5,6,7]. However, the mechanism of IVIG action on KD has not been revealed, and 10鈥?5% of KD patients do not respond to the treatment and have a higher risk for CAL. Recently a series of genome-wide association studies (GWAS) revealed several definitive KD susceptibility loci [8,9,10]. However, these genetic factors can explain only a part of the etiology. Also, the reason for the high incidence among East Asian children [11], which is up to 20-fold higher than those in Western countries and is a crucial epidemiologic feature of KD, has not yet been explained. It might be attributed in part to lack of statistical power in the previous GWAS analyses that were carried out using modest sample-sizes, and therefore, many common genetic factors with relatively low penetrance may have gone undetected. In this study, to identify novel susceptibility loci for KD, we conducted a meta-analysis of GWAS from Japan, Korea, and Taiwan, the three countries with the highest incidence of KD in the world.Materials and methodsSubjects and design of association analyses in identifying novel susceptibility loci for KDIn this study, susceptibility loci for KD were screened and verified in a two-stage association analysis (Fig.聽1). Stage 1 was a whole-genome meta-analysis of results from GWAS conducted in Japan [9], Taiwan [10], and Korea [12] involving 6362 individuals. We carried out follow-up association studies using three case-control panels comprised of 1418 KD patients and 1700 controls (Japanese), 473 KD patients and 484 controls (Korean) and 261 KD patients and 567 controls (Taiwanese) independent from the subjects in the GWAS and performed meta-analyses with the Stage 1 results (Stage 2). Loci for the follow-up studies were selected based on their predicted potential to achieve P values less than 5.0鈥壝椻€?0鈭? in the meta-analyses, with the prediction made by iterative simulations of follow-up studies with virtual case and control cohorts (detailed below). To further validate the associations of rs4774175 and rs6423677 in 14q32.33, an additional 1758 KD cases and 653 controls collected in Japan were used. The number of KD cases and controls, as well as platforms in the three previous GWAS and follow-up studies in Japan, Korea, and Taiwan are summarized in Supplementary Table聽1.Fig. 1Flow of the screening of the novel susceptibility loci for KD in this studyFull size imageWhole-genome imputation and meta-analysesFor the Stage 1 analysis, each study center鈥檚 genotype data for the Illumina Human Hap550/610 or Affymetrix SNP 6.0 arrays (Supplementary Table聽1) was oriented to the forward strand of the hg19 human reference genome. Genotype data were filtered for minimum quality-control parameter cutoffs such as HWE-P鈥夆墺鈥?.0001, Call-rate 鈮?.95, and minor allele frequency (MAF)鈥夆墺鈥?.01. Each study center performed whole-genome imputation of autosomal variants with 572 East Asian haplotypes from the 1000 Genomes Project Phase 1 Version 3 (ref. [13]) as reference using pre-phasing with SHAPEIT v1 (ref. [14]) and imputation with IMPUTE2 and minimac softwares [15, 16].Imputed genotype data were analyzed by each study center in a case-control logistic regression analysis, and the output was merged in the R statistics environment (URL: https://www.R-project.org/) and filtered for variants that were polymorphic in both cases and controls (MAFcases 鈮?.01 and MAFcontrols 鈮?.01) and had info 鈮?.4 in all three studies鈥?dataset; there were 6,265,045 SNVs in the final filtered dataset. A fixed-effects meta-analysis of the three data sets鈥?beta-coefficients and standard errors was performed using the R package metafor (https://cran.r-project.org/package=metafor).Linkage disequilibrium-based region definitionEach chromosome鈥檚 SNVs were filtered for those with a meta-analysis P鈥?lt;鈥?.01 and then labelled based on linkage disequilibrium to regional 鈥渢op SNVs鈥? Briefly, SNVs with P鈥?lt;鈥?鈥壝椻€?0鈭? were sorted by P value, the top SNV identified, and any SNVs within 卤5鈥塎b that had r2鈥?gt;鈥?.1 to that top SNV assigned to its region; data on a chromosome was processed as such until no SNVs with P鈥?lt;鈥?鈥壝椻€?0鈭? were remaining. Each top SNV region was labelled based on chromosome and minimum and maximum positions of the linked SNVs. Based on that process, each region label would be unique, but regions could overlap with each other. Calculation of r2 was performed using the ld function in the R Bioconductor snpStats package using East Asian genotype data from the 1000 Genomes Phase 1 Version 3 release. P value simulationFor simplicity, we will refer to the regions of nominally linked SNVs described above as 鈥渓oci鈥? To perform the Stage 2 follow-up study efficiently, loci that had a high potential of achieving P values less than 5.0鈥壝椻€?0鈭? in a meta-analysis of Stage 1 and 2 results were selected by P value simulation as follows. For each locus, we first selected any nominally associated SNVs (P鈥?lt;鈥?.001) identified in the Stage 1 analysis, and for each SNV created a virtual set of 100,000 case genotypes and a virtual set of 100,000 control genotypes for each of the three Stage 1 case-control panels. Each virtual set was generated based on the genotype frequencies observed in that panel鈥檚 Stage 1 cases or controls at a particular SNV, and each of those virtual sets was then re-sampled in R to randomize their order. Each virtual case and control set was then sampled 100 times to create 100 virtual case-control cohorts; the numbers of cases and controls sampled from the virtual genotype sets were the case-control counts that were expected in the three collaborators鈥?follow-up studies (RIKEN in Japan: ncases鈥?鈥?300, ncontrols鈥?鈥?300; Academia Sinica in Taiwan: ncases鈥?鈥?00, ncontrols鈥?鈥?000; Asan Medical Center, University of Ulsan in Korea: ncases鈥?鈥?12, ncontrols鈥?鈥?00). For each iteration, associations of the candidate SNVs were evaluated in a meta-analysis of the virtual cohorts and the three GWAS data sets. For each candidate SNV, the frequency of observing P values of 5.0鈥壝椻€?0鈭? or smaller in 100 iterations of the simulated meta-analysis was scored as the simulation score. Loci with at least one SNV having simulation scores of 0.8 or higher were considered to be promising, and for de novo genotyping, a representative SNV was selected for which assays were designable across the different platforms employed in each research center (Invader in Japan, Sequenom MassARRAY or TaqMan in Taiwan and VeraCode GoldenGate Genotyping kit or TaqMan in Korea, respectively) (Supplementary Table聽1).Genotyping of SNVs in IGHV genesNonsynonymous SNVs in IGHV genes were genotyped basically by the Invader Assay. Primers and the probes were carefully designed in order to ensure specificity of the assay. We refrained from using multiplex PCR to avoid both expected and unexpected nonspecific amplification of DNA fragments of high sequence homology which will allow cross reaction between amplicons and probes for different loci. Sequences of the primers and probes for 18 nonsynonymous SNVs in IGHV genes and rs4774175, as well as representative genotyping results of rs6423677, are provided in Supplementary Tables聽2 and 3 and Supplementary Fig.聽1, respectively.Next-generation sequencing (NGS) of IGHV repertoiresTwo milliliters of venous blood was drawn from patients who were admitted to the hospitals for KD at four time points including (1) acute phase before receiving IVIG (3鈥? illness days), (2) 48鈥塰 after the patients became afebrile (8鈥?8 illness days), (3) the first follow-up visit to the pediatric clinic after discharge (17鈥?0 days after the disease onset), and (4) the second follow-up visit to the pediatric clinic after discharge (3鈥? months after the disease onset). Blood samples were collected into Vacutainer CPT Cell Preparation Tube (BD) and mononuclear cells were separated according to the manufacturer鈥檚 instruction. Total RNA from the mononuclear cells was extracted by using the RNeasy Mini Kit (QIAGEN). 1.0鈥壩糶 of RNA was reverse transcribed with PrimeScript (TAKARA) and the mixed oligonucleotides of random hexamer and oligo-dT primers. Isotype-specific libraries for NGS were prepared as follows. Mixed forward primers covering the framework region 1 of 7 subgroups of IGHV genes (V1-V7) [17] and reverse primers specific to each IGHC gene for IgM, IgD, IgG, and IgA (including a partial Illumina adapter sequence in the 5鈥?ends of both primers) were designed for the 1st round PCR. Sequences of the primers are provided in Supplementary Table聽4. 6-base barcode sequence and the full Illumina adapter sequence were added at 5鈥?and 3鈥?ends of the immunoglobulin amplicons in the 2nd round PCR. The barcode sequences were used to distinguish the patients and the sampling time points. The libraries were sequenced with MiSeq Reagent Kit v3 (600-cycle) (Illumina).Forward and reverse sequence reads were merged by using FLASh software [18]. Sequences unmerged due to insufficient overlapping length were excluded from subsequent analyses. Merged sequence reads were classified into isotypes and subclasses based on primer sequences by using blast software (https://blast.ncbi.nlm.nih.gov/Blast.cgi), and then quality filtering and removing the primer sequence were performed by using the FASTX-Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/index.html).Repertoire analyses of immunoglobulin heavy chainsImmunoglobulin repertoires were determined by migmap-1.0.2 software (https://github.com/mikessh/migmap). Sequence reads that terminated prematurely or had a complementarity determining region (CDR) 3 that was non-canonical (i.e., lacking consensus amino acids at both ends or not fully mapped) were excluded from subsequent analyses. Clonotypes were defined by combinations of IGHV, IGH diversity (IGHD), and IGH joining (IGHJ) gene alleles. Correlation trend between the proportions of the IGH clonotypes using IGHV3-66 and the C allele counts at rs6423677 was evaluated by the Jonckheere-Terpstra test using R package PMCMR (https://cran.r-project.org/package=PMCMR). Change of each clonotype proportion from baseline (the 1st sampling point) was calculated by subtracting the baseline proportion from those at 2nd, 3rd, and 4th points. The diversity of CDR3 clonotypes was evaluated by the inverse Simpson鈥檚 index using the R package vegan (https://cran.r-project.org/package=vegan).ResultsSignificant associations in the meta-analyses of three GWAS data setsIn the Stage 1 analysis, a meta-analysis of three GWAS data sets, genome-wide level associations were seen in known susceptibility loci such as FAM167A-BLK (rs2736340; P鈥?鈥?.23鈥壝椻€?0鈭?6), ITPKC (rs28493229; P鈥?鈥?.07鈥壝椻€?0鈭?), CASP3 (rs2720377; P鈥?鈥?.66鈥壝椻€?0鈭?) and CD40 (rs1883832; P鈥?鈥?.76鈥壝椻€?0鈭?). Four top SNVs in HLA class III~ class II regions (rs2844485, rs73729123, rs7739458, and rs146650659) and rs13330932 in the 16q24.1 region also showed significant associations (Supplementary Table聽5, Supplementary Fig.聽2). From a standard test for inflation, the genomic control inflation factor 位GC鈥?鈥?.0216 (Supplementary Fig.聽2).Follow-up of analyses of 49 most promising lociIn a simulation of meta-analyses, a 鈥渟imulation score鈥?was calculated for each nominally associated SNV (P鈥夆墹鈥?.001; see Methods). Forty-nine loci were identified with one or more SNVs expected to satisfy a P value threshold of 5.0鈥壝椻€?0鈭? with simulation score of 0.8 or higher (Supplementary Table聽6). From every locus, we selected representative SNVs for genotyping in replication sample panels and then performed a meta-analysis with data from the three GWAS. For one particular SNV locus (rs181511609) that satisfied the criteria, a genotyping assay could not be designed due to the complexity of the surrounding sequence. As a result, 12 out of 48 examined loci showed significant associations (Table聽1, Supplementary Table聽7). These included four previously known susceptibility gene loci such as CASP3, FAM167A-BLK, ITPKC, and CD40, seven on chromosome 6p21 (NC_000006.11: 25.5鈥?2.78鈥塎b) and one in the immunoglobulin heavy variable gene (IGHV) region on chromosome 14q32.33. The trend of association for rs13330932 on chromosome 16q24.1 was not replicated in the follow-up sample panels from the three populations (Supplementary Table聽7).Table 1 Loci achieving P values鈥?lt;鈥?.0鈥壝椻€?0鈭? in the meta-analyses of Stage 1 and 2 dataFull size tableAssociation signals that newly achieved genome-wide significance after the meta-analyses of three GWAS and follow-up association studiesA significant association of rs2857151 located near HLA-DOB and HLA-DQB2 genes with KD susceptibility was reported in the earlier Japanese GWAS [9]. However, the present study revealed that the association statistics for rs2857151 were not consistent across the other East Asian populations (Supplementary Table聽8). Instead, 7 out of 12 groups of SNVs in the 6p21 region examined in the Stage 2 analyses showed significant association in the meta-analyses of the data sets in the three GWAS as well as in the follow-up studies (Table聽1 and Supplementary Fig.聽3A). To verify that those seven signals were associated independently from rs2857151, we calculated LD between these SNVs and rs2857151, and we also performed logistic regression analyses conditioning on rs2857151 using the Japanese Stage 2 sample set. Consistent with information from 1000 Genomes, the seven candidates were not in high LD with each other (Supplementary Fig.聽3B). The association was most significant for rs2857151, and in the conditioned logistic regression analyses, the P value for rs2071473, which was located just 19鈥塳b distal to and in marginal LD with rs2857151 (D鈥测€?鈥?.99, r2鈥?鈥?.36), became nonsignificant (P鈥?gt;鈥?.05). After conditioning, P values for the other SNVs including rs2857151 itself increased but were still significant (P鈥夆墹鈥?.05). (Supplementary Table聽9). These included SNVs with previous information on their functional significance or association with diseases such as rs2857602, where the associated allele is linked to the A allele at rs2239704 (r2鈥?鈥?.0 in the 1000 Genomes JPT population), which has been associated with reduced expression of lymphotoxin A [19], and rs7775228, which was previously associated with asthma in a Japanese population [20]. Thus, further investigation including that in each ethnic group is needed to unravel the involvement of the variants in this region in KD susceptibility.In the meta-analysis of the Stage 1 and 2 studies, a significant association was also obtained for an SNV within the chromosome 14q32.33 region represented by rs4774175 (P鈥?鈥?.0鈥壝椻€?0鈭?) (Table聽1). Five hundred and nineteen SNVs, linked with rs4774175 (r2鈥?gt;鈥?.1), and having simulation score of 0.5 or higher were distributed across a 340鈥塳b region (106.88鈥?07.22鈥塎b) of this locus where many IGHV genes as well as their pseudogenes are clustered in tandem. Similar to the HLA region, most IGHV genes harbor common nonsynonymous SNVs within them contributing to the high level of diversity observed within the immunoglobulin heavy chain repertoire. Therefore, we proceeded to analyze the nonsynonymous SNVs in that locus. Among the 519 SNVs, 18 were nonsynonymous and the others were intergenic, intronic, in the untranslated region, or synonymous SNVs. When the Japanese sample set used in the Stage 2 analysis was genotyped for these 18 nonsynonymous SNVs, rs6423677 in the IGHV3-66 gene showed the most significant association (Table聽2). As expected, some other nonsynonymous SNVs also showed a similar trend of association. However, those associations, including that of rs4774175, were considered to be explained by LD with rs6423677 because they were no longer significant after conditioning the logistic regression analyses on rs6423677 (Table聽2). In contrast, rs6423677 remained significant (P鈥?lt;鈥?.05) after adjusting for any of the other SNVs (data not shown). In a Stage 3 analysis, rs4774175 and rs6423677 were further examined in an additional Japanese sample set, and rs6423677 achieved genome-wide significance in a meta-analysis of the Japanese sample panels (Table聽3). Meanwhile, the association trend of rs6423677 was weaker in Korean and unreproducible in Taiwanese subjects (Supplementary Table聽10).Table 2 Association of the 18 nonsynonymous SNVs of IGHV genes as candidates and rs4774175Full size tableTable 3 Association of rs6423677 and KD in the Japanese case and control panelsFull size tablers6423677 genotypes and IGHV3-66 gene usage in the immunoglobulin heavy chain repertoireThe significant association of an SNV within IGHV3-66 with KD prompted us to investigate the impact of rs6423677 on the function of immunoglobulin molecules or B cell receptors (BCRs) harboring IGHV3-66 in their heavy chain variable regions. Firstly, we examined IGHV usage in different immunoglobulin isotypes (IgM, IgD, IgG, and IgA) by analyzing the immunoglobulin heavy chain repertoire in peripheral B lymphocytes from healthy Japanese individuals (n鈥?鈥?0) and patients with acute KD (n鈥?鈥?) by NGS. For every Ig isotype, we could identify IGHV3-66 clonotypes in each individual and found that the IGHV3-66 usage tended to be correlated with the number of C alleles each individual had (Fig.聽2A and Supplementary Fig.聽4). Consistent with the correlation trend, we observed significantly skewed usage of the alleles in the analyses of the five patients who were heterozygous for rs6423677, with the protective allele (A) almost completely suppressed in all immunoglobulin classes and time points examined (Supplementary Fig.聽5). Similar correlation trend was also observed between rs6423677 genotypes and relative levels of IGH transcript for IGHV3-64, the closest neighboring gene to IGHV3-66. However, the correlations were more modest than that seen for IGHV3-66 (Supplementary Fig.聽6). Paralogues or pseudogenes of IGHV3-66 which have the 鈥楢鈥?nucleotide at the corresponding position of rs6432677 as well as PCR amplification bias in library preparation caused by additional variants interfering with the primer annealing could lead to such results. However, the distinct separation pattern of the allele discrimination plot in the Invader Assay and the clear electropherogram data in the direct sequencing of PCR-amplified genomic DNA from heterozygous patients does not support that possibility (Supplementary Fig.聽1). Thus, we consider these observations not to be artificial in origin. We also negated the chance of misassignment of IGHV3-66 with the A allele at rs6423677 (IGHV3-66*03) to other alleles (*01, *02, or *04) or paralogues such as IGHV3-53 by confirming that the migmap software could correctly assign an artificially created sequence with the rs6423677 A allele as IGHV3-66*03 (data not shown).Fig. 2 Effect of rs6423677 genotypes on the relative expression level of immunoglobulin heavy chain transcripts with IGHV3-66. A The proportion of clonotypes with IGHV3-66 among all IGH transcripts displayed by rs6423677 genotype for each immunoglobulin isotype. Ten healthy adult volunteers (upper) and nine KD patients (lower) were analyzed. For KD patients, mean data of proportions at the four evaluation points were plotted. The increasing trend of the proportions according to the number of the C allele at rs6423677 was statistically evaluated with Jonckheere-Terpstra trend test. *P鈥?lt;鈥?.005, 鈥?/sup>
P鈥?lt;鈥?.01. B Time-course chan
GE of the sequence reads proportions of immunoglobulin heavy chain transcripts with IGHV3-66 of acute KD patients. Evaluation was done on four time points including (1) acute phase before receiving IVIG (3鈥? illness days), (2) 48鈥塰 after the patients became afebrile (8鈥?8 illness days), (3) the first follow-up visit to the pediatric clinic after discharge (17鈥?0 days after the disease onset) and (4) the second follow-up visit to the pediatric clinic after discharge (3鈥? months after the disease onset).In each panel, symbols and lines of red, green, and blue colors are representing KD patients with genotypes of AA, AC, and CC at rs6423677, respectively. The analyses were carried out for four immunoglobulin classes (IgM, IgD, IgG, and IgA)Full size imageA feature of the immunoglobulin repertoire of peripheral B lymphocytes from KD patientsTo obtain an insight into the role of immunoglobulins using IGHV3-66 in their heavy chain variable region in the pathogenesis of KD, we investigated how IGHV3-66 usage in the immunoglobulin heavy chain gene repertoire changed over time during the clinical courses of nine KD patients (Supplementary Table聽11). Proportions of IGH clones harboring IGHV3-66 in their variable region seemed to be stable or not to have common varying patterns for IgM, IgD, and IgA, while in two patients (patients 2 and 5), transient increases were observed for IgG at the third evaluation point (Fig.聽2B). Then, we investigated the increased clonotypes in the IgG heavy chain repertoire more precisely by defining them by the combinations of IGHV, IGHD, and IGHJ genes. We found that both patients 2 and 5 had single clonotypes with IGHV3-66 that increased 1.0% or more as a proportion of the total from the first to the third evaluation points. However, the gene combinations of them were not the same (Table聽4), and CDR3 amino acid sequences corresponding to the V-D-J combinations also differed between the patients (data not shown). Five of the remaining seven patients also had one or more IgG heavy chain clonotypes with IGHV3-66 that increased 0.1% or more at the same follow-up time point. However, cooccurring V-D-J combinations in the increased clonotypes were only seen between patients 2 and 6.Table 4 IgG heavy chain clonotypes using IGHV3-66 that increased more than 0.1% as a proportion at the third evaluation pointFull size tableOn the other hand, some of the CDR3 clonotypes which have recently been reported to dominate in the IgM heavy chain repertoire ( 0.001%) in the acute pretreatment phase of Taiwanese KD patients [21] were also prevalent at a similar time point in the Japanese KD patients in this study (Supplementary Table聽12). A distinct increase in diversity of the CDR3 clonotypes of IgM heavy chain during the convalescent phase of KD, which has also been reported in previous research of Taiwanese KD patients [21], was observed for four of the nine KD patients (Supplementary Fig.聽7).DiscussionIt has generally been considered that the immune-mediated vasculitis of KD is triggered in response to infection with some type of microorganism. This assumption was made based on factors indicative of a primary infection, such as its clinical symptoms, which included fever and skin rash, epidemiologic findings such as its peak age of onset (6 months to 1 year), along with the seasonality of the occurrence of regional outbreaks and nation-wide epidemics [2, 11]. Despite tremendous efforts, no single microorganism has been conclusively proven as the pathogen of KD and lack of information about the pathogen has been a significant obstacle to causal treatment and disease prevention.Histopathological and immunological studies have revealed activation of neutrophils, macrophages, and monocytes in the acute phase of KD [22, 23], and now the leading hypothesis for the pathophysiology of KD inflammation is an attack of the dysregulated or hyperactive innate immune system against vascular walls [24]. In addition, genetic studies using a genome-wide approach have identified several robust susceptibility loci/genes for KD [8,9,10, 12, 25,26,27,28,29,30,31,32], and an in silico prediction of responsible variants and the types of cells where they have biological significance has highlighted the importance of B cells in the pathogenesis of KD [33]. In previous literature, polyclonal activation and increase of B cells [34, 35] and detection of auto-antibodies against components of vascular endothelial cells or neutrophils in peripheral blood of acute KD patients have been documented [36, 37]. Infiltration of oligoclonal plasma cells producing IgA into the vascular wall of KD patients was also reported [38]. These previous findings have suggested the involvement of B cells in KD inflammation. However, a recent transcriptomic study revealed downregulation of genes related to BCR signaling in acute KD patients [39]. Thus, there have been both supportive and unsupportive pieces of evidence, and there is no consensus view on the role of B cells, as well as of the adaptive immune system in KD pathogenesis. In this study, we identified genetic variants within the IGHV region significantly associated with KD in the Japanese population. The immunoglobulin heavy (IGH) locus spans 1.3鈥塎b (from 106.0 to 107.3鈥塎b on NC_000014.8) of the chromosome 14q32.33 region and encompasses serially arranged IGH constant (IGHC), IGHJ, IGHD, and IGHV clusters. Because of high sequence redundancy which obstructs designing of specific genotyping assays, there is a large blank area (~0.8鈥塎b) which was not covered by the SNP arrays and therefore we lack association information about the SNVs in the area (Supplementary Fig.聽8). However, because SNVs within the gap are not in high LD (r2鈥?lt;鈥?.3) with rs6423677 or rs4774175 in the 1000 genomes JPT population (Supplementary Fig.聽8) and we did not observe association signals in the chromosomal region on the opposite side of the blank area, the association signals represented by rs6423677 or rs4774175 might be localized within the region where we could investigate in this study.In contrast to the robust association of rs6423677 in the Japanese subjects, it was not consistent in the Korean and Taiwanese subjects. Although neither significant nor consistent with the results in the current study, a marginal trend of association between SNVs within the IGHV region and linked to rs4774175 (r2鈥?鈥?.45鈥?.66 in the 1000genomes CHB population) and KD in the Taiwanese population had already been reported [28]. Findings in the previous and the current studies are strongly indicating that variants in the IGHV region are commonly involved in KD pathogenesis at least in the Japanese and Taiwanese populations. However, the robustness of association might not be uniform among the populations. Given that multiple pathogens with regionally or seasonally differing epidemic patterns could be triggering KD, it might be possible that various susceptibility genes or alleles corresponding to different antigenicity for the pathogens exist in this locus and are related to the mixed robustness of the association.Highly skewed expression of IGH transcripts with the risk-associated allele adds support to consider that IGHV3-66 is the functional target of the associated variants. The A to C nucleotide change at rs6423677 results in an amino acid alteration from cysteine to glycine within the CDR2 region of IGHV3-66. Thus, the modification might affect the affinity to some antigens of immunoglobulins carrying IGHV3-66. However, we consider that the mechanism of increased susceptibility to KD associated with the C allele might not be due to reduced host defense to particular agents because it is inconsistent with the observation that the A allele, which is protective against KD and expected to have a higher neutralizing ability in this scenario, seemed to be nearly silenced. IGHV3-66 has been recognized as one of the functional IGHV genes and purported to be utilized at frequencies of around 1% [40]. The average proportions of IGHV3-66 clonotypes in the 10 healthy adults in our study (0.88% for IgM, 0.50% for IgD, 1.0% for IgG, and 1.1% for IgA), were consistent with that previous information. It is currently unknown in what stage, i.e., somatic recombination, RNA transcription, and processing of the pre-mRNA, the A allele was excluded and resulted in the significantly skewed allelic usage to the C allele of rs6423677. It is suggestive that usage of IGHV3-64, the nearest functional gene to IGHV3-66, showed difference among genotypes at rs6423677 which was similar to that of IGHV3-66 (Supplementary Fig.聽6). One potential reason is suggested by the predicted binding of CTCF transcription repressor in the 0.4鈥塳b region encompassing rs6423677 (Supplementary Fig.聽9). CTCF has been reported to interact with multiple sites in the IGH region and plays essential roles in somatic recombination [41] of the distal area of the IGHV gene region. The association data of rs6423677 and the increased usage of IGH transcripts with the risk-associated allele are indicative of some active role of the immunoglobulin molecules as antibodies or as components of BCRs in the development of KD. One possible role of such immunoglobulins might be activation of B cells. Established KD susceptibility gene products such as B lymphoid tyrosine kinase (BLK) [8,9,10] and Inositol 1,4,5 trisphosphate 3-kinase C (ITPKC) [8, 9] are involved in BCR signaling. If IVIG acts through competing with such immunoglobulins for agents or antigens relevant to KD pathogenesis, the requirement of a high dose administration (1鈥?鈥塯/kg) of IVIG to treat KD might be reasonable because only a fraction of the IgG would contribute to the therapeutic effect, with IGHV3-66 expected to only account for up to several percent of the IVIG preparations. B cells can be activated by nonspecific binding of microbial products such as superantigens (SAgs) to BCRs. Intriguingly, B cell SAgs restricted to IGHV3 segment of immunoglobulins have been known [42]. However, considering that the innate immune system has been thought to play a central role in the KD vasculitis and that KD can develop in patients with X-linked agammaglobulinemia, who lack or have small numbers of B cells [43], the activity of B cells or immunoglobulins in KD might be relevant to initiation or enhancement of the innate immune activation but may be substitutable.Recently, a significant association of an allele of the IGHV4-61 gene (IGHV4-61*02) with susceptibility to rheumatic heart disease (RHD), which is a long-term complication of ARF, was reported [44]. IGHV4-61 is located only 36鈥塳b downstream of IGHV3-66 (Supplementary Fig.聽6). ARF develops as a sequela of Streptococcus pyogenes (S. pyogenes) infection and, similar to KD, has been recognized to affect genetically susceptible individuals [45]. Among reports of GWAS for human diseases, a genome-wide significant association of variants in the IGHV gene region has been identified only for RHD. Although S. pyogenes is not recognized as the cause of KD, considering that KD shares some characteristic symptoms such as skin rash and strawberry tongue with S. pyogenes infection, it is suggestive that the previously discussed role of B cells might be related to some underlying mechanisms of the common symptoms.In 2020, multiple series of patients with Pediatric Inflammatory Multisystem Syndrome Temporally Associated with SARS-CoV2 infection (PIMS-TS) or (MIS-C) having KD-like symptoms or increase of severe KD patients after the SARS-CoV2 epidemic were reported from the US and European countries [46, 47]. In the latest studies, overrepresentation of IGHV3-53 and IGHV3-66 in neutralizing monoclonal antibodies against the receptor binding domain of SARS-CoV2 spike were also reported [48, 49]. Given immunoglobulin with IGHV3-66 play a role in KD, it might be possible that KD-like symptoms seen in such patients are mediated by interaction between SARS-CoV2 and B cells expressing IGHV3-66.We also found some commonality between the CDR3 clonotypes that were increased in the IgM heavy chains of the Taiwanese and the Japanese KD patients (Supplementary Table聽12). IGHV3-66 was used only in one commonly increased CDR3 clonotype (Supplementary Table聽13). However, as far as can be understood from the limited number of observations, IGHV3-66 seemed not to be important in the IgM response in the acute pretreatment phase.Future characterization of the endogenous immunoglobulin molecules that are increased in KD patients utilizing information of the light chains that can be obtained simultaneously in single-cell analyses [50] will facilitate identification of the agent triggering KD as well as understanding the mechanism of action of IVIG treatment.There are limitations in this study. First, in addition to the gap above where we could not examine the association of the variants, our strategy to focus only on nonsynonymous SNVs on IGHV genes left the possibility that rs6423677 is just a proxy of the genuinely responsible variant located outside IGHV3-66. Second, we did not analyze the time-course change of the immunoglobulin heavy chain repertoire in other infectious diseases. So it is uncertain whether the upregulation of IgG heavy chain transcripts with IGHV3-66 is a specific observation for KD or not. Third, our results might not directly reflect changes of the immunoglobulins at the protein level because we lack information about the correlation between the proportions of particular IGH clones in the transcripts from B cells and in the proteins expressed on the cell surface or circulating in the serum.In conclusion, a significant association of a nonsynonymous SNV in the IGHV3-66 gene with KD was observed. Further intensified study of the association in this region and repertoire analyses of immunoglobulins in different ethnicities and subpopulations of the patients with different demographic features would give insights into both the role of B cells in the KD pathogenesis and the causal agent of the disease. References1.Kawasaki T. Acute febrile mucocutaneous syndrome with lymphoid involvement with specific desquamation of the fingers and toes in children. (Japanese) Arerugi 1967; 16: 178鈥?22. English translation by Shike H, Burns JC, Shimizu C. Pediatr Infect Dis J. 2000;21:993鈥?. Google Scholar聽 2.Makino N, Nakamura Y, Yashiro M, Sano T, Ae R, Kosami K, et al. Epidemiological observations of Kawasaki disease in Japan, 2013-2014. Pediatr Int. 2018;60:581鈥?.PubMed聽Google Scholar聽 3.Kato H, Koike S, Yamamoto M, Ito Y, Yano E. Coronary aneurysms in infants and young children with acute febrile mucocutaneous lymph node syndrome. J Pediatr. 1975;86:892鈥?.CAS聽 PubMed聽Google Scholar聽 4.Taubert KA, Rowley AH, Shulman ST. Nationwide survey of Kawasaki disease and acute rheumatic fever. J Pediatr. 1991;119:279鈥?2.CAS聽 PubMed聽Google Scholar聽 5.Furusho K, Sato K, Soeda T, Matsumoto H, Okabe T, Hirota T, et al. High-dose intravenous gammaglobulin for Kawasaki disease. Lancet. 1984;2:1055鈥?.CAS聽 PubMed聽Google Scholar聽 6.Newburger JW, Takahashi M, Burns JC, Beiser AS, Chung KJ, Duffy CE, et al. The treatment of Kawasaki syndrome with intravenous gamma globulin. N. Engl J Med. 1986;315:341鈥?.CAS聽 PubMed聽Google Scholar聽 7.Newburger JW, Takahashi M, Beiser AS, Burns JC, Bastian J, Chung KJ, et al. A single intravenous infusion of gamma globulin as compared with four infusions in the treatment of acute Kawasaki syndrome. N. Engl J Med. 1991;324:1633鈥?.CAS聽 PubMed聽Google Scholar聽 8.Khor CC, Davila S, Breunis WB, Lee YC, Shimizu C, Wright VJ, et al. Genome-wide association study identifies FCGR2A as a susceptibility locus for Kawasaki disease. Nat Genet. 2011;43:1241鈥?.CAS聽 PubMed聽Google Scholar聽 9.Onouchi Y, Ozaki K, Burns JC, Shimizu C, Terai M, Hamada H, et al. A genome-wide association study identifies three new risk loci for Kawasaki disease. Nat Genet. 2012;44:517鈥?1.CAS聽 PubMed聽Google Scholar聽 10.Lee YC, Kuo HC, Chang JS, Chang LY, Huang LM, Chen MR, et al. Two new susceptibility loci for Kawasaki disease identified through genome-wide association analysis. Nat Genet. 2012;44:522鈥?.CAS聽 PubMed聽Google Scholar聽 11.Burns JC, Kushner HI, Bastian JF, Shike H, Shimizu C, Matsubara T, et al. Kawasaki disease: a brief history. Pediatrics. 2000;106:E27.CAS聽 PubMed聽Google Scholar聽 12.Kim JJ, Hong YM, Sohn S, Jang GY, Ha KS, Yun SW, et al. A genome-wide association analysis reveals 1p31 and 2p13.3 as susceptibility loci for Kawasaki disease. Hum Genet. 2011;129:487鈥?5.PubMed聽Google Scholar聽 13.1000 Genomes Project Consortium, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56鈥?5. Google Scholar聽 14.Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat Methods. 2012;9:179鈥?1.CAS聽Google Scholar聽 15.Howie B, Marchini J, Stephens M. Genotype imputation with thousands of genomes. G3. 2011;1:457鈥?0.PubMed聽Google Scholar聽 16.Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet. 2012;44:955鈥?.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 17.Gao F, Lin E, Feng Y, Mack WJ, Shen Y, Wang K. Characterizing immunoglobulin repertoire from whole blood by a personal genome sequencer. PLoS One. 2013;8:e75294.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 18.Mago膷 T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27:2957鈥?3.PubMed聽 PubMed Central聽Google Scholar聽 19.Knight JC, Keating BJ, Kwiatkowski DP. Allele-specific repression of lymphotoxin-alpha by activated B cell factor-1. Nat Genet. 2004;36:394鈥?.CAS聽 PubMed聽Google Scholar聽 20.Hirota T, Takahashi A, Kubo M, Tsunoda T, Tomita K, Doi S, et al. Genome-wide association study identifies three new susceptibility loci for adult asthma in the Japanese population. Nat Genet. 2011;43:893鈥?.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 21.Ko TM, Kiyotani K, Chang JS, Park JH, Yin YP, Chen YT, et al. Immunoglobulin profiling identifies unique signatures in patients with Kawasaki disease during intravenous immunoglobulin treatment. Hum Mol Genet. 2018;27:2671鈥?.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 22.Takahashi K, Oharaseki T, Naoe S, Wakayama M, Yokouchi Y. Neutrophilic involvement in the damage to coronary arteries in acute stage of Kawasaki disease. Pediatr Int. 2005;47:305鈥?0.PubMed聽Google Scholar聽 23.Koga M, Ishihara T, Takahashi M, Umezawa Y, Furukawa S. Activation of peripheral blood monocytes and macrophages in Kawasaki disease: ultrastructural and immunocytochemical investigation. Pathol Int. 1998;48:512鈥?.CAS聽 PubMed聽Google Scholar聽 24.Hara T, Nakashima Y, Sakai Y, Nishio H, Motomura Y, Yamasaki S. Kawasaki disease: a matter of innate immunity. Clin Exp Immunol. 2016;186:134鈥?3.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 25.Onouchi Y, Gunji T, Burns JC, Shimizu C, Newburger JW, Yashiro M, et al. ITPKC functional polymorphism associated with Kawasaki disease susceptibility and formation of coronary artery aneurysms. Nat Genet. 2008;40:35鈥?2.CAS聽 PubMed聽Google Scholar聽 26.Onouchi Y, Ozaki K, Burns JC, Shimizu C, Hamada H, Honda T, et al. Common variants in CASP3 confer susceptibility to Kawasaki disease. Hum Mol Genet. 2010;19:2898鈥?06.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 27.Burgner D, Davila S, Breunis WB, Ng SB, Li Y, Bonnard C, et al. A genome-wide association study identifies novel and functionally related susceptibility Loci for Kawasaki disease. PLoS Genet. 2009;5:e1000319.PubMed聽 PubMed Central聽Google Scholar聽 28.Tsai FJ, Lee YC, Chang JS, Huang LM, Huang FY, Chiu NC, et al. Identification of novel susceptibility Loci for Kawasaki disease in a Han chinese population by a genome-wide association study. PLoS One. 2011;6:e16853.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 29.Khor CC, Davila S, Shimizu C, Sheng S, Matsubara T, Suzuki Y, et al. Genome-wide linkage and association mapping identify susceptibility alleles in ABCC4 for Kawasaki disease. J Med Genet. 2011;48:467鈥?2.CAS聽 PubMed聽Google Scholar聽 30.Shimizu C, Eleftherohorinou H, Wright VJ, Kim J, Alphonse MP, Perry JC, et al. Genetic variation in the SLC8A1 calcium signaling pathway is associated with susceptibility to Kawasaki disease and coronary artery abnormalities. Circ Cardiovasc Genet. 2016;9:559鈥?8.CAS聽 PubMed聽Google Scholar聽 31.Kim J, Shimizu C, Kingsmore SF, Veeraraghavan N, Levy E, Dos Santos AMR, et al. Whole genome sequencing of an African American family highlights toll like receptor 6 variants in Kawasaki disease susceptibility. PLoS One. 2017;12:e0170977.PubMed聽 PubMed Central聽Google Scholar聽 32.Kim JJ, Yun SW, Yu JJ, Yoon KL, Lee KY, Kil HR, et al. A genome-wide association analysis identifies NMNAT2 and HCP5 as susceptibility loci for Kawasaki disease. J Hum Genet. 2017;62:1023鈥?.CAS聽 PubMed聽Google Scholar聽 33.Farh KK, Marson A, Zhu J, Kleinewietfeld M, Housley WJ, Beik S, et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature. 2015;518:337鈥?3.CAS聽 PubMed聽Google Scholar聽 34.Leung DY, Siegel RL, Grady S, Krensky A, Meade R, Reinherz EL, et al. Immunoregulatory abnormalities in mucocutaneous lymph node syndrome. Clin Immunol Immunopathol. 1982;23:100鈥?2.CAS聽 PubMed聽Google Scholar聽 35.Furukawa S, Matsubara T, Yabuta K. Mononuclear cell subsets and coronary artery lesions in Kawasaki disease. Arch Dis Child. 1982;67:706鈥?. Google Scholar聽 36.Leung DY, Collins T, Lapierre LA, Geha RS, Pober JS. Immunoglobulin M antibodies present in the acute phase of Kawasaki syndrome lyse cultured vascular endothelial cells stimulated by gamma interferon. J Clin Investig. 1986;77:1428鈥?5.CAS聽 PubMed聽Google Scholar聽 37.Savage CO, Tizard J, Jayne D, Lockwood CM, Dillon MJ. Antineutrophil cytoplasm antibodies in Kawasaki disease. Arch Dis Child. 1989;64:360鈥?.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 38.Rowley AH, Shulman ST, Spike BT, Mask CA, Baker SC. Oligoclonal IgA response in the vascular wall in acute Kawasaki disease. J Immunol. 2001;166:1334鈥?3.CAS聽 PubMed聽Google Scholar聽 39.Ikeda K, Yamaguchi K, Tanaka T, Mizuno Y, Hijikata A, Ohara O, et al. Unique activation status of peripheral blood mononuclear cells at acute phase of Kawasaki disease. Clin Exp Immunol. 2010;160:246鈥?5.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 40.Lange MD, Huang L, Yu Y, Li S, Liao H, Zemlin M, et al. Accumulation of VH replacement products in IgH genes derived from autoimmune diseases and anti-viral responses in human. Front Immunol. 2014;5:345.PubMed聽 PubMed Central聽Google Scholar聽 41.Guo C, Yoon HS, Franklin A, Jain S, Ebert A, Cheng HL, et al. CTCF-binding elements mediate control of V(D)J recombination. Nature. 2011;477:424鈥?0.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 42.Silverman GJ, Sasano M, Wormsley SB. Age-associated changes in binding of human B lymphocytes to a VH3-restricted unconventional bacterial antigen. J Immunol. 1993;151:5840鈥?5.CAS聽 PubMed聽Google Scholar聽 43.Behniafard N, Aghamohammadi A, Abolhassani H, Pourjabbar S, Sabouni F, Rezaei N. Autoimmunity in X-linked agammaglobulinemia: Kawasaki disease and review in the literature. Expert Rev Clin Immunol. 2012;8:155鈥?.CAS聽 PubMed聽Google Scholar聽 44.Parks T, Mirabel MM, Kado J, Auckland K, Nowak J, Rautanen A, et al. Pacific Islands rheumatic heart disease genetics network. Association between a common immunoglobulin heavy chain allele and rheumatic heart disease risk in Oceania. Nat Commun. 2017;8:14946.PubMed聽 PubMed Central聽Google Scholar聽 45.Carapetis JR, Currie BJ, Mathews JD. Cumulative incidence of rheumatic fever in an endemic region: a guide to the susceptibility of the population? Epidemiol Infect. 2000;124:239鈥?4.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 46.Riphagen S, Gomez X, Gonzalez-Martinez C, Wilkinson N, Theocharis P. Hyperinflammatory shock in children during COVID-19 pandemic. Lancet. 2020;395:1607鈥?.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 47.Whittaker E, Bamford A, Kenny J, Kaforou M, Jones CE, Shah P, et al. Clinical characteristics of 58 children with a pediatric inflammatory multisystem syndrome temporally associated With SARS-CoV-2. JAMA. 2020;324:259鈥?9.CAS聽 PubMed聽Google Scholar聽 48.Cao Y, Su B, Guo X, Sun W, Deng Y, Bao L, et al. Potent neutralizing antibodies against SARS-CoV-2 identified by high-throughput single-cell sequencing of convalescent patients鈥?B cells. Cell. 2020;182:73鈥?4.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 49.Barnes CO, West AP Jr, Huey-Tubman KE, Hoffmann MAG, Sharaf NG, Hoffman PR, et al. Structures of human antibodies bound to SARS-CoV-2 spike reveal common epitopes and recurrent features of antibodies. Cell. 2020;182:828鈥?2.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 50.DeKosky BJ, Ippolito GC, Deschner RP, Lavinder JJ, Wine Y, Rawlings BM, et al. High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire. Nat Biotechnol. 2013;31:166鈥?.CAS聽 PubMed聽 PubMed Central聽Google Scholar聽 Download referencesAcknowledgementsThis study was supported by grants from the Millennium Project, from the Japan Kawasaki Disease Research Center (2015 to YM, and 2018 and 2019 to YO), and from the Japan Agency for Medical Research and Development (JP18ek0410039 to TH). This study was also supported by a grant from the Ministry of Health Welfare of the Republic of Korea (HI15C1575 to JKL). We are grateful to the KD patients and their family members as well as the medical staff taking care of the patients. We also thank Ms. Yoshie Kikuchi for her technical assistance.Author informationAuthor notesThese authors contributed equally: Todd A. Johnson, Yoichi Mashimo, Jer-Yuarn Wu, Dankyu YoonAffiliationsLaboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, JapanTodd A. Johnson聽 聽Tatsuhiko TsunodaDepartment of Public Health, Chiba University Graduate School of Medicine, Chiba, Chiba, 260-8670, JapanYoichi Mashimo,聽Akira Hata聽 聽Yoshihiro OnouchiInstitute of Biomedical Sciences, Academia Sinica, Taipei, 11529, TaiwanJer-Yuarn Wu,聽Chien-Hsiun Chen,聽Yi-Min Liu,聽Li-Ching Chang,聽Chun-Ping Chang聽 聽Yuan-Tsong ChenDivision of Allergy and Chronic Respiratory Diseases, Center for Biomedical Sciences, Korea National Institute of Health, Osong, 28160, KoreaDankyu YoonRIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, JapanMichiaki KuboLaboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, JapanAtsushi TakahashiDepartment of Genomic Medicine, Research Institute, National Cerebral and Cardiovascular Center, Suita, Osaka, 564-8565, JapanAtsushi TakahashiDepartment of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, 113-0033, JapanTatsuhiko TsunodaDepartment of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, 113-8510, JapanTatsuhiko TsunodaLaboratory for Cardiovascular Genomics and Informatics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, JapanKouichi Ozaki,聽Toshihiro Tanaka,聽Kaoru Ito聽 聽Yoshihiro OnouchiDivision for Genomic Medicine, Medical Genome Center, National Center for Geriatrics and Gerontology, Obu, Aich, 474-8511, JapanKouichi OzakiDepartment of Human Genetics and Disease Diversity, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University, Tokyo, 113-8510, JapanToshihiro TanakaDepartment of Pediatrics, Wakayama Medical University, Wakayama, Wakayama, 641-8509, JapanHiroyuki SuzukiDepartment of Pediatrics, Tokyo Women鈥檚 Medical University Yachiyo Medical Center, Yachiyo, Chiba, 276-8524, JapanHiromichi HamadaDepartment of Management and Strategy, Clinical Research Center, National Center for Child Health and Development, Tokyo, 157-8535, JapanTohru KobayashiFukuoka Children鈥檚 Hospital, Fukuoka, Fukuoka, 813-0017, JapanToshiro HaraInstitute of Cellular and Organismic Biology, Academia Sinica, Taipei, 11529, TaiwanYi-Ching LeeDepartment of Pediatrics, Ewha Womans University College of Medicine, Seoul, 07985, KoreaYoung-Mi HongDepartment of Pediatrics, Korea University Ansan Hospital, Ansan, 15355, KoreaGi-Young JangDepartment of Pediatrics, Chung-Ang University Hospital, Seoul, 06973, KoreaSin-Weon YunDepartment of Pediatrics, Asan Medical Center, Seoul, 05505, KoreaJeong-Jin YuDepartment of Pediatrics, Daejeon St. Mary鈥檚 Hospital, Daejeon, 34943, KoreaKyung-Yil LeeAsan Institute for Life Sciences, University of Ulsan College of Medicine, Seoul, 05505, KoreaJae-Jung Kim聽 聽Jong-Keuk LeeDepartment of Statistics, Seoul National University, Seoul, 08826, KoreaTaesung ParkAuthorsTodd A. JohnsonView author publicationsYou can also search for this author in PubMed聽Google ScholarYoichi MashimoView author publicationsYou can also search for this author in PubMed聽Google ScholarJer-Yuarn WuView author publicationsYou can also search for this author in PubMed聽Google ScholarDankyu YoonView author publicationsYou can also search for this author in PubMed聽Google ScholarAkira HataView author publicationsYou can also search for this author in PubMed聽Google ScholarMichiaki KuboView author publicationsYou can also search for this author in PubMed聽Google ScholarAtsushi TakahashiView author publicationsYou can also search for this author in PubMed聽Google ScholarTatsuhiko TsunodaView author publicationsYou can also search for this author in PubMed聽Google ScholarKouichi OzakiView author publicationsYou can also search for this author in PubMed聽Google ScholarToshihiro TanakaView author publicationsYou can also search for this author in PubMed聽Google ScholarKaoru ItoView author publicationsYou can also search for this author in PubMed聽Google ScholarHiroyuki SuzukiView author publicationsYou can also search for this author in PubMed聽Google ScholarHiromichi HamadaView author publicationsYou can also search for this author in PubMed聽Google ScholarTohru KobayashiView author publicationsYou can also search for this author in PubMed聽Google ScholarToshiro HaraView author publicationsYou can also search for this author in PubMed聽Google ScholarChien-Hsiun ChenView author publicationsYou can also search for this author in PubMed聽Google ScholarYi-Ching LeeView author publicationsYou can also search for this author in PubMed聽Google ScholarYi-Min LiuView author publicationsYou can also search for this author in PubMed聽Google ScholarLi-Ching ChangView author publicationsYou can also search for this author in PubMed聽Google ScholarChun-Ping ChangView author publicationsYou can also search for this author in PubMed聽Google ScholarYoung-Mi HongView author publicationsYou can also search for this author in PubMed聽Google ScholarGi-Young JangView author publicationsYou can also search for this author in PubMed聽Google ScholarSin-Weon YunView author publicationsYou can also search for this author in PubMed聽Google ScholarJeong-Jin YuView author publicationsYou can also search for this author in PubMed聽Google ScholarKyung-Yil LeeView author publicationsYou can also search for this author in PubMed聽Google ScholarJae-Jung KimView author publicationsYou can also search for this author in PubMed聽Google ScholarTaesung ParkView author publicationsYou can also search for this author in PubMed聽Google ScholarJong-Keuk LeeView author publicationsYou can also search for this author in PubMed聽Google ScholarYuan-Tsong ChenView author publicationsYou can also search for this author in PubMed聽Google ScholarYoshihiro OnouchiView author publicationsYou can also search for this author in PubMed聽Google ScholarConsortiaKorean Kawasaki Disease Genetics Consortium, Taiwan Kawasaki Disease Genetics Consortium, Taiwan Pediatric ID Alliance, Japan Kawasaki Disease Genome ConsortiumContributionsJKL, JYW, YTC, and YO supervised the study. JKL, JYW, and YO conceived the study. JYW, TAJ, TT, JKL, and YO designed the study. TAJ, YM, JKL, JYW, and YO wrote the manuscript. AH, HS, HH, TH, and Japan Kawasaki Disease Genome Consortium collected Japanese samples. YMH, GYJ, SWY, JJY, KYL, and Korean Kawasaki Disease Genetics Consortium collected Korean samples. Taiwan Kawasaki Disease Genetics Consortium and Taiwan Pediatric ID Alliance collected Taiwanese samples. YML coordinated the multi-center collaboration in Taiwan as the project manager and collected samples and clinical information. MK performed GWAS assays for the Japanese samples. AT performed statistical analyses for the Japanese GWAS data. DY and TP performed statistical analyses for the Korean GWAS data. JJK conducted a follow-up study (Stage 2) for the Korean samples. CHC performed statistical analyses for the Taiwanese GWAS data and followed-up meta-analyses for the Taiwanese data. YCL supervised the GWAS and replication genotyping pipeline, performed the data analyses. LCC performed statistical analyses for the Taiwanese GWAS data and followed-up meta-analyses for the Taiwanese data. CPC performed genotyping and direct sequencing of Taiwanese samples. TAJ, DY, and CHC conducted the whole-genome imputation. TAJ performed P value simulation and meta-analyses. KO, TT, and KI performed genotyping and direct sequencing of the Japanese samples. YM and YO performed the NGS data analyses for the IGH repertoires.Corresponding authorsCorrespondence to Jer-Yuarn Wu or Jong-Keuk Lee or Yuan-Tsong Chen or Yoshihiro Onouchi.Ethics declarations Conflict of interest The authors declare that they have no conflict of interest. Ethical approval This study was approved by the Institutional Review Board at all involved institutes. Additional informationPublisher鈥檚 note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Members of the Korean Kawasaki Disease Genetics Consortium, Taiwan Kawasaki Disease Genetics Consortium, Taiwan Pediatric ID Alliance, Japan Kawasaki Disease Genome Consortium are listed in the Supplementary information.Supplementary information