Introduction
Host immune defense mechanisms are coordinated by interferons (IFNs). IFNs elicit cell‐intrinsic antiviral activity as well as cell‐extrinsic inflammatory responses leading to activation and recruitment of diverse immune cells (Sadler & Williams,
2008; Schneider
et al,
2014; Wadman,
2020). The different families of IFNs, type I, II, and III engage these defense mechanisms to different degrees. Type I IFNs are widely known for their antiviral activity (Ivashkiv & Donlin,
2014; McNab
et al,
2015), but in several pathological scenarios there is evidence for a role in triggering inflammation (Makris
et al,
2017). Type II IFN is less known for generating an antiviral state, but for an enhanced activation potential state that allows macrophages to orchestrate recruitment of diverse effector cells and initiate adaptive immune responses (Ivashkiv,
2018). Like type I, type III IFNs have a primary role inducing antiviral activity, but there is less evidence for activating inflammatory mediators (Ank
et al,
2006,
2008; Ye
et al,
2019).
Both type I and type III IFNs activate the same JAK–STAT/IRF signaling pathway and transcription factor, ISGF3 (Mesev
et al,
2019; Stanifer
et al,
2019). While type I IFNs have a broad range of effectiveness due to the ubiquitous expression of the IFNAR subunits on most cell types (de Weerd & Nguyen,
2012), type III IFNs are associated with specific tissues due to the cell type‐specific expression of these cytokines and the cognate IFNλR1 receptor subunit (de Weerd & Nguyen,
2012). The human type I IFN family comprises 13 subtypes of IFN‐α, IFN‐β, IFN‐ε, IFN‐κ, and IFN‐ω. Type III IFNs comprise four members, namely IFN‐λ1, IFN‐λ2, IFN‐λ3, and IFN‐λ4. Altogether, there are 21 IFNs that activate the same transcription factor. This remarkable diversity of ligands activating the same transcription factor raises the question of whether their biological roles are overlapping or distinct. Studies have shown that IFN‐α subtypes share the capacity to elicit antiviral states but differ in their cytostatic effects in a manner that correlates with their affinity for the IFNAR receptor (Jaks
et al,
2007; Schreiber & Piehler,
2015). Among the type I IFNs, IFN‐β has been shown to be particularly potent due to its high binding affinity and sustained life span of the IFN‐IFNAR complex (Kalie
et al,
2008; Schreiber & Piehler,
2015). Among the members of type III IFNs, IFN‐λ3 has the most potent antiviral activity (Dellgren
et al,
2009; Bolen
et al,
2014).
While the roles of type I and III IFNs in cell‐intrinsic immunity have been well‐defined, their regulation of cell‐extrinsic immune functions has been less well characterized. A link between type I IFNs and inflammation has been described in various infections and may involve immune cell recruitment via the expression of monocyte‐recruiting chemokines (Davidson
et al,
2014; Channappanavar
et al,
2016; Makris
et al,
2017; Israelow
et al,
2020). For type III IFNs, the evidence for initiating cell‐extrinsic inflammation functions is less clear (Jordan
et al,
2007; Galani
et al,
2017; Hemann
et al,
2019; Read
et al,
2019). In clinical settings, type I IFNs have been used to treat hepatitis B and COVID‐19, but its adoption as an effective therapeutic has been hampered by its adverse side effects (e.g., flu‐like symptoms and fatigue) associated with inflammatory cytokines (Rijckborst & Janssen,
2010; Reder & Feng,
2014;
2021). In contrast, type III IFN treatments have limited adverse effects, comparable to that of placebo groups (
2021; Feld
et al,
2021; Jagannathan
et al,
2021).
The differential propensity of type I and III IFNs to induce inflammation could be due to cell type‐specific expression of cognate receptors, or differences in signaling responses. In neutrophils, which respond to both type I and III IFNs, a type I IFN‐specific inflammatory gene expression response was reported but not characterized mechanistically (Galani
et al,
2017). However, for lung epithelial cells, key to initiating immune responses during respiratory infections, no such information is available. Importantly, both type I and III IFNs play a role in their cellular responses, as an influenza A viral challenge in primary mouse tracheal epithelial cells deficient in either IFNAR or IFNλR1 resulted in similar gene expression programs (Crotta
et al,
2013). Hence, there is a need to investigate whether and how IFN‐type I and III induce distinct responses within the same cell type.
The molecular mechanisms of signal transduction are well understood. Upon binding their cognate receptors, type I and III IFNs activate the same kinases JAK1 and TYK2, which activate the same transcription factor, ISGF3. ISGF3 is responsible for inducing the expression of hundreds of IFN‐stimulated genes (ISGs; Stark & Darnell,
2012; Schneider
et al,
2014; Mesev
et al,
2019). ISGF3 is a trimer composed of the active, phosphorylated forms of STAT1 and STAT2 along with IRF9. The active trimer translocates to the nucleus where it binds to the promoters of ISGs to induce gene expression. While ISGF3 is the primary transcription factor induced by type I and III IFNs, others such as GAF, MAPK/ERK, and NFκB have also been implicated as IFN‐inducible (Pfeffer,
2011; Pervolaraki
et al,
2017; Mazewski
et al,
2020; Kishimoto
et al,
2021). A mathematical model of the IFN signaling pathway has been established and used to investigate how signaling is affected by prior IFN exposure or conditioning (Maiwald
et al,
2010; Kok
et al,
2020). However, whether and how ISGF3 signaling is distinct in response to type I and III IFN stimulation has not been quantitatively examined.
Here, we investigated whether type I IFN‐β and III IFN‐λ3 generate distinct gene expression responses in lung epithelial cells. We identified a type I IFN‐specific gene expression program that is characterized by inflammatory response mediators. Iterating between quantitative experimentation and mathematical modeling, we investigated the mechanism and report that the dynamic control of a single transcription factor, ISGF3, allows for the distinction. A secondary, potentiated phase of ISGF3 is responsible for the inflammatory gene expression program. We show that the potentiated ISGF3 activation phase is enabled by a stimulus‐contingent positive feedback loop that is engaged at high doses and sustained durations of type I IFN‐β. Our results indicate that stimulus‐specific dynamic control of ISGF3 elicits distinct immune gene expression responses, revealing the importance of transcription network feedback control in the JAK–STAT/IRF signaling system for healthy immunity.
Discussion
Here, we report that in lung epithelial cells type I IFN‐β induces an antiviral gene expression program similar to type III IFN‐λ3 and also a gene expression program that includes the prominent inflammatory instigators IL6, TRAIL, and CCRL2. Using experimental and mathematical modeling, we show that the type I IFN‐specific gene expression program is regulated by the temporal dynamics of ISGF3. By adapting a mathematical model of the IFN signaling pathway to epithelial cells, we predicted and then experimentally confirmed the dose and duration dependency of the ISGF3 temporal dynamics necessary for the type I IFN‐induced inflammatory gene expression program. We also found that the ISGF3‐induced expression of its components, STAT2 and IRF9, constitutes positive feedback loops that are important for regulating ISGF3 temporal dynamics and the gene activation of inflammatory instigators.
Our study emphasizes that the IFN gene expression response is not monolithic but depends on the type, dose, and duration of stimulus exposure. Previous studies have alluded to this by observing that type I and III IFN‐induced gene expression in hepatocytes shows different kinetics (Bolen
et al,
2014; Jilg
et al,
2014). Using IFNAR and IFNλR overexpressing cells, it was found that IFN‐type‐specific kinetic differences were not due to the IFN dose or receptor abundance (Pervolaraki
et al,
2018), but the underlying mechanisms remained obscure. Here, we also identified a group of genes (whose biological functions are primarily in cell‐intrinsic innate defenses) in epithelial cells that show differential kinetics in response to type I and III IFNs. And we identified a group of genes (whose biological functions include cell‐extrinsic inflammation‐induced immune responses) that show substantially different magnitudes such that they may be described as being expressed stimulus‐selectively, specific to high‐dose IFN‐β exposure. We did not find a repressed gene expression program with either IFN stimulus although IFN‐repressed genes (IRepG) have been reported in other experimental systems (Trilling
et al,
2013; Megger
et al,
2017).
We then addressed the underlying mechanism for type I IFN‐specific gene expression. Our results suggest that the stimulus specificity in gene expression responses may be accounted for by stimulus‐specific dynamics of the JAK–STAT/IRF signaling network and need not involve additional pathways that were previously implicated, such as MAPK or NFκB (Pfeffer,
2011; Pervolaraki
et al,
2017; Mitchell
et al,
2019; Mazewski
et al,
2020). We provide evidence that type I and III IFN stimulation results in distinct temporal dynamics of ISGF3; while initial activation is similar, only IFN‐β‐induced ISGF3 shows a potentiated phase. The response selectivity of cell‐extrinsic inflammation‐induced immune genes due to differences in ISGF3 temporal dynamics could be recapitulated with a mathematical model by postulating a degree of ultra‐sensitivity in ISGF3‐promoter interactions. The fact that these genes have a larger distance between their transcription start sites and STAT1 binding sites and thus may require DNA looping during the activation process suggests a plausible explanation for ultrasensitive, nonlinear dose–responses (Kuhlman
et al,
2007; Earnest
et al,
2013; Hou
et al,
2018). Nevertheless, our analysis also revealed substantial heterogeneity within the gene expression response, such that there may be multiple regulatory mechanisms. For example, a coherent feed‐forward loop with ISGF3 and ISGF3‐induced IRF1 may play a role (Forero
et al,
2019). Indeed, a finer gene expression analysis and more IFN stimulation conditions will likely identify additional gene expression patterns, as ultimately every immune response gene is probably subject to unique regulatory control mechanisms (Tong
et al,
2016; Wang
et al,
2021).
Our study addresses the molecular mechanisms by which IFN‐β‐induced IGSF3 temporal dynamics are encoded. Prior mechanistic studies of IFN subtype differences have focused on IFN‐receptor affinity measurements: IFN‐β has high affinity (IFNAR2: 0.2 nM, IFNAR1: 50 nM; Kalie
et al,
2008; Schreiber & Piehler,
2015), while type III IFN‐λ3 binding affinities for its cognate receptors are weaker (IL10Rβ: 850 nM, IFNλR1: 16 μM). This could play a role in the differential cellular responses as engineered interferon variants can have differential innate immune vs inflammatory effects (Mendoza
et al,
2017; Yen
et al,
2022). However, kinetic properties of receptor trafficking, receptor turnover and recycling, ligand half‐life, and autoregulation mediated by feedback loops may be just as relevant. While not the focus of this work, negative regulators (e.g., SOCS1, SOCS3, and USP18) of the IFN signaling pathway likely impact the inactivation phases of the ISGF3 temporal dynamics (Kubo
et al,
2003; Malakhova
et al,
2006; Blumer
et al,
2017). Additional analyses of the negative regulatory mechanisms may reveal stimulus‐specific differences.
Our iterative systems biology analysis using a quantitative dynamical systems model enabled our studies of ISGF3 regulatory mechanisms (Kok
et al,
2020) and emphasizes the importance of the feedback loops in tailoring the gene expression response to type, dose, and duration of exposure. The critical role of IRF9 in determining the magnitude of ISGF3 was previously described (Maiwald
et al,
2010), but here we provide evidence that autoregulation of ISGF3 mediated by IRF9 and STAT2 results in a positive feedback loop that shapes the stimulus‐specific dynamics of ISGF3. Positive feedback loops due to positive autoregulation in transcriptional network motifs are known to determine response times, signal sensitivity and amplification, and system bistability (Xiong & Ferrell,
2003; Mitrophanov & Groisman,
2008; Mitrophanov
et al,
2010). We demonstrate that autoregulation of ISGF3 results in positive feedback loop characteristics but also functions as a filter to distinguish stimulus duration and dose, akin to a coherent feed‐forward loop (Alon,
2007). This occurs because active ISGF3 activates the expression of ISGF3 components (inactive ISGF3) that then require continued kinase activity to produce more active ISGF3 to induce the potentiated phase. A stimulus‐contingent positive feedback loop governs in principle other immune response transcription factors, such as NFκB, which induces the expression of its components RelA, cRel, and p50, as well as AP1. This induces the expression of its components Fos and Jun. Our study here may prompt an examination of their dynamical properties to determine whether they also function as filters of dose and duration potentially enabling the stimulus‐specific activation of target genes. Finally, other feedback loops may also play a role in specifying the dynamic control of ISGF3, particularly at later times beyond the potentiated phase described here.
Our work characterizes a role for type I IFN‐β in cell‐extrinsic inflammation through induction of inflammatory instigator genes in epithelial cells. We report that the gene expression of inflammatory regulators IL‐6, TNFSF10, the gene that encodes for TNF‐related apoptosis‐inducing ligand (TRAIL), and CCRL2 is tightly controlled and dependent on potentiated activity of ISGF3. Previous reports have described a link between type I IFNs and IL‐6 and TNFSF10 in respiratory illnesses (Hadjadj
et al,
2020; Galani
et al,
2021), as the latter induces epithelial cell apoptosis (Chaperot
et al,
2006; Herold
et al,
2008; Davidson
et al,
2014). CCRL2 is a nonsignaling transmembrane receptor that presents chemerin, which recruits leukocytes. The detrimental effects of inflammation are prevented in CCRL2‐deficient mice (Schioppa
et al,
2020).
Precise control of IFN responses is critically important, as misregulation of IFNs can result in immunopathogenesis and autoimmunity or exacerbate the pathogenesis of respiratory viral infections (Baechler
et al,
2003; Banchereau & Pascual,
2006; Blanco‐Melo
et al,
2020; Hadjadj
et al,
2020; Galani
et al,
2021). While high IFN‐λ1 levels were associated with lower SARS‐CoV‐2 viral loads and faster clearance in a cohort of COVID‐19 patients (Galani
et al,
2021), a higher IFN‐β‐to IFN‐λ1 ratio was linked to longer hospitalization. Similarly, in a SARS‐CoV‐1 infection model late activation of type I IFN contributed to lung pathology and morbidity (Channappanavar
et al,
2016). This highlights the need to further explore type I IFN cell‐extrinsic roles during immunopathogenesis.
Misregulation of IFNs and their signaling responses leading to disease has also been linked to genetic variants. Inborn errors of immunity (IEI) have been identified for genetic variants of STAT1 and IRF9 with mutations causing either gain or loss of function (Tangye
et al,
2020). Other pathway regulators have also been shown to have variants causing IEI such as JAK1, TYK2, IFNAR1, and IFNAR2 (Tangye
et al,
2020; Zhang
et al,
2020; Staels
et al,
2021). While IEI have been thought to be rare, common single‐nucleotide polymorphisms (SNPs) have also been found in TYK2 and IFNAR1 and are associated with improved protection from infection as well as increased risk of autoimmunity (Sugrue
et al,
2021). The prevalence of such SNPs may account for a wide range of IFN pathway responses among individuals (Su
et al,
2008; Choudhury
et al,
2014). It will be of interest to delineate how such variants affect the engagement of the described positive feedback loop. Furthermore, the feedback expression of IRF9 and STAT2 may be subject to epigenetic regulation, such that exposure history could modulate pathway regulation and lead to highly individual immune responses. As the inflammatory gene cluster discriminates between modest and high levels of ISGF3, we would expect that there is more variation in the IFN‐inflammatory response axis than the antiviral defense functions of IFN.
Our work suggests that ISGF3 dynamic control may be key to understanding the plethora of specific IFN response functions. It suggests strategies to studying these further, leveraging mathematical modeling to develop increasingly reliable simulations that may be a tool for evaluating the potential impact of individual genetic and epigenetic differences for a new generation of precision medicine.
Materials and Methods
Cell culture
MLE‐12 lung epithelial cells were obtained from ATCC (#CRL‐2110) and cultured in DMEM with 4.5 g/l glucose, l‐glutamine, and sodium pyruvate (Corning #10‐013‐CV) supplemented with 10% fetal bovine serum, 100 IU/ml penicillin, 100 μg/ml streptomycin (Corning #30‐002‐CI), and 2 mM l‐glutamine (Corning #25‐005‐CI) at 37°C at 5% CO2. For stimulations, MLE‐12 cells were incubated with 1 U/ml or 10 U/ml recombinant mouse IFN‐β (PBL Assay Science #12401‐1) and 100 ng/ml recombinant mouse IL‐28B/IFN‐λ3 (R&D Systems #1789‐ML‐025/CF). For select experiments, cells were also incubated with 10 μg/ml cycloheximide (Sigma‐Aldrich #C7698) or a vehicle (ethanol) control.
Protein expression analysis by immunoblotting
Protein from extracts was measured (BioRad DC™ Protein Assay Kit II #5000112), and equal amounts of protein were loaded into a 4–15% gradient polyacrylamide gel (BioRad TGX™ Precast Midi Protein Gel #5671085) and proteins separated based on molecular weight using electrophoresis at 110 V. Proteins were transferred onto a PVDF membrane (Fisher Scientific Immobilon®‐P IPVH00010) at 100 V for 1 h. The membrane was incubated for 1 h in 5% bovine serum albumin (BSA; Millipore Sigma #A3059) followed by overnight incubation at 4°C in a primary antibody solution. The following antibodies were used in this study: phospho‐STAT1 (pY701.4A) mouse monoclonal IgG2aĸ targeting to phosphorylated Tyr701 (Santa Cruz Biotechnology #sc‐136229), STAT1 p84/p91 (E‐23) rabbit polyclonal IgG with epitope matching near the C terminus (Santa Cruz Biotechnology #sc‐346), STAT1 p84/p91 rabbit polyclonal IgG antibody (Cell Signaling #9172), phospho‐STAT2 (Tyr689) rabbit polyclonal IgG antibody with epitope matching at phosphorylated Tyr689 (EMD Millipore #07–224), STAT2 (D9J7L) rabbit monoclonal IgG antibody with epitope corresponding near Leu706 (Cell Signaling #72604), IRF9, clone 6F1‐H5, mouse monoclonal IgG2aĸ antibody with epitope matching the N terminus (EMD Millipore #MABS1920), nuclear matrix protein p84 [EPR5662(2)] rabbit monoclonal antibody mapping with aa350‐450 (abcam #ab131268), and αTubulin (B‐7) mouse monoclonal IgG2aĸ antibody raised against epitope with aa149‐448 (Santa Cruz Biotechnology #sc‐5286). The membrane was washed in TBS‐T (0.5% Tween‐20) and incubated in a secondary antibody solution for 1 h at room temperature. The following horseradish peroxidase (HRP)‐conjugated secondary antibodies were used for this study: mouse anti‐rabbit IgG (L27A9) monoclonal antibody (Cell Signaling #5127), goat anti‐rabbit IgG polyclonal antibody (Cell Signaling #7074S), horse anti‐mouse IgG polyclonal antibody (Cell Signaling #7076), mouse anti‐rabbit IgG monoclonal antibody (Santa Cruz Biotechnology #sc2357), and bovine anti‐mouse IgG polyclonal antibody (Santa Cruz Biotechnology #sc2380). The protein bands were resolved by chemiluminescence (Thermo Fisher Scientific SuperSignal West Pico and Femto Chemiluminescent Substrate #34080 and 34095).
Electrophoretic mobility shift assay
The protocol was used as described previously (Hoffmann
et al,
2002; Werner
et al,
2005). In brief, nuclear fractions were collected by extraction using a Tris (250 mM Tris, 60 mM KCl, 1 mM EDTA) buffer solution. Equal amounts of nuclear protein were incubated for 30 min in a binding buffer (10 mM Tris‐Cl, 50 mM NaCl, 10% glycerol, 1% NP‐40, 1 mM EDTA, 0.1 mg/ml Poly(deoxyinosinic‐deoxycytidylic) acid sodium salt) with 0.01 pmol P‐32 radiolabeled oligo containing the ISRE consensus sequence (ISG15 loci; 5′‐GATCCTCGGGAAAGGGAAACCTAAACTGAAGCC‐3′; 5′‐GGCTTCCAGTTTAGGTTTCCCTTTCCCGAGGATC‐3′) or a NFY binding sequence (5′‐GATTTTTTCCTGATTGGTTAAA‐3′; 5′‐ACTTTTAACCAATCAGGAAAAA‐3′). Samples were loaded onto a 5% polyacrylamide gel (5% glycerol and TGE buffer [24.8 mM Tris base, 190 mM glycine, 1 mM EDTA]). Bands were resolved using electrophoresis at 200 V. Gels were dried and imaged using an Amersham Typhoon 5 laser scanner (GE Healthcare).
Gene expression analysis by RT–qPCR
RNA was collected from cells lysed in TRIzol Reagent (Fisher Scientific #15–596‐018) and isolated using the Direct‐zol RNA Extraction Kit (Zymo Research #R2052). Equal amounts of RNA were reverse transcribed into cDNA using the Iscript™ Reverse Transcription Supermix (BioRad #1708841). Equal volumes of diluted (1:8) cDNA were loaded for qPCR analysis along with the SsoAdvanced Universal SYBR Green Supermix (BioRad #1725272) and 0.5 μM forward and reverse primers for target genes (listed below). Amplification curve thresholds were determined and selected for each primer set.
mSTAT1 (5′‐GGCCTCTCATTGTCACCGAA‐3′; 5′‐TGAATGTGATGGCCCCTTCC‐3′);
mSTAT2 (5′‐GTAGAAACCCAGCCCTGCAT‐3′; 5′‐CTTGTTGCCCTTTCCTGCAC‐3′);
mIRF9 (5′‐TCTTTGTTCAGCGCCTTTGC‐3′; 5′‐CTGCTCCATCTGCACTGTGA‐3′);
mSOCS1 (5′‐CAACGGAACTGCTTCTTCGC‐3′; 5′‐AGCTCGAAAAGGCAGTCGAA‐3′);
mSOCS3 (5′‐CTTTTCTTTGCCACCCACGG‐3′; 5′‐CCGTTGACAGTCTTCCGACA‐3′);
mUSP18 (5′‐CTCACATGTTTGTTGGGTCACC‐3′; 5′‐TGAAATGCAGCAGACAAGGG‐3′);
mTNFSF10 (5′‐GATGAAGCAGCTGCAGGACAAT‐3′; 5′‐CTGCAAGCAGGGTCTGTTCAAG‐3′);
mCD274 (5′‐TCGCCTGCAGATAGTTCCCAAA‐3′; 5′‐GTAAACGCCCGTAGCAAGTGAC‐3′);
mCCRL2 (5′‐GAGCAAGGACAGCCTCCGAT‐3′; 5′‐CCACTGTTGTCCAGGTAGTCGT‐3′);
mGBP5 (5′‐TGCTGACATGAGCTTCTTCCCA‐3′; 5′‐TCATCGCTACCTTGCTTCAGCT‐3′);
mCGAS (5′‐TGGGCACAAAAGTGAGGACCAA‐3′; 5′‐CGCCAGGTCTCTCCTTGAAAACT‐3′);
mMYD88 (5′‐TCTCCAGGTGTCCAACAGAAGC‐3′; 5′‐TGCAAGGGTTGGTATAGTCGCA‐3′);
mTLR2 (5′‐GGGGCTTCACTTCTCTGCTT‐3′; 5′‐AGCATCCTCTGAGATTTGACG‐3′);
mIL6 (5′‐ACCAGAGGAAATTTTCAATAGGC‐3′; 5′‐TGATGCACTTGCAGAAAACA‐3′);
mGAPDH (5′‐AGCTTGTCATCAACGGGAAG‐3′; 5′‐TTTGATGTTAGTGGGGTCTCG‐3′).
RNA sequencing and data analysis
RNA was collected from cells lysed in TRIzol Reagent (Fisher Scientific #15–596‐018) and isolated using the Direct‐zol RNA Extraction Kit (Zymo Research #R2052). RNA libraries were generated using the KAPA Stranded mRNA‐Sequencing Kit with KAPA mRNA Capture Beads (Roche #07962207001). Samples were enriched for poly(A) + mRNA strands using mRNA Capture Beads. The captured mRNA was synthesized into complimentary DNA (cDNA). cDNA libraries were ligated using Illumina TruSeq single index adapters (Roche #KK8700), amplified, and quantified using a Qubit dsDNA Broad Range Assay Kit (Life Technologies #Q32850) and a Qubit 2.0 fluorometer. Sequencing was conducted on an Illumina HiSeq 2500 with single‐end 50 bp reads through the UCLA Broad Stem Cell Research Center. Sequencing reads were aligned and mapped onto the mm10 genome. The EdgeR package (Bioconductor 3.7) was used for differential gene expression analysis of raw read counts, which were normalized using trimmed mean of the mean (TMM) and filtered by the number of mapped reads and gene length (RPKM) of greater than 2. Normalized counts were applied to a negative binomial generalized linear model, which was used to calculate biological coefficients of variation. Pairwise dispersions between the normalized counts were calculated using the Cox‐Reid Likelihood profile method. The differential expression testing was initiated using the likelihood ratio test (LRT), a form of ANOVA testing. Using the TREAT Method and Benjamini–Hochberg multiple test correction, significantly induced genes were selected based on differences in RPKM values at each time point compared with values in unstimulated conditions using a false discovery rate (FDR) of less than 0.05 based on replicate data and fold change greater than 2. PCA plots were generated using the prcomp function in R and plotted with ggplot2 (Gómez‐Rubio,
2017). Correlation plots were plotted using ggplot2. Heatmaps were generated using the pheatmap package.
ChIP sequencing and data analysis
The protocol was used as described previously (Kishimoto
et al,
2021). In brief, cells were fixed with 2 mM disuccinimidyl glutarate for 30 min followed by a 10‐min incubation in 1% formaldehyde. Cells are lysed (Buffer 1: 50 mM HEPES‐KOH, 140 mM NaCl, 1 mM EDTA, 10% glycerol, 0.5% NP‐40, 0.25% Triton X‐100, and 1X protease inhibitor cocktail (Thermo Fisher Scientific); Buffer 2: 10 mM Tris–HCl, 200 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, and 1X protease inhibitor cocktail; Buffer 3: 10 mM Tris–HCl, 100 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 0.1% Na Deoxycholate, 0.5% N‐lauroylsarcosine, and 1X protease inhibitor cocktail) and sonicated to shear the chromatin. The DNA‐bound complexes were incubated with STAT1 p84/p91 rabbit polyclonal IgG antibody (Cell Signaling #9172) overnight at 4°C. STAT1 was immunoprecipitated by incubating for 5 h at 4°C with Protein‐G DynaBeads (Invitrogen #10004D). DNA bound to STAT1 complexes was extracted using 1% SDS and 0.6 mg/ml of Proteinase K (New England Biolabs #P81075) at 60°C overnight, followed by purification with Agencourt AMPure XP solid‐phase reversible immobilization (SPRI) beads (Beckman Coulter #A63881). DNA was quantified using a Qubit dsDNA High Sensitivity Assay Kit (Life Technologies #Q32851) and a Qubit 2.0 fluorometer. Libraries were prepared using NEBNext Ultra II DNA Library Prep Kit NEBNext Multiplex Oligos (New England Biolabs) according to the manufacturer instructions and sequenced with a length of 50 bp on an Illumina HiSeq 2500 at the UCLA Broad Stem Cell Research Center. Sequencing data were aligned to mm10 genome as described previously (Kishimoto
et al,
2021). MACS2 version 2.1.0 was used to identify peaks for each sample using pooled input samples as control with FDR < 0.01 and extension size of average fragment length (Zhang
et al,
2008). STAT1 motifs were identified by searching the HOMER database for position‐weighted matrix files of STAT/GAS and ISRE/IRF motifs (Heinz
et al,
2010). ISRE/IRF motifs included motifs for T1ISRE, ISRE, IRF2, IRF1, IRF3, and IRF8. STAT/GAS motifs included motifs for STAT1, STAT3 + IL‐21, STAT3, STAT4, and STAT5. A variant GAS motif in the promoter of Irf1 (GATTTCCCCGAATG) known to be bound by STAT1 was also included in the search as a GAS motif (Decker
et al,
1991). Peak‐to‐gene distances were calculated with bedtools, using the TSS of the longest isoform (Quinlan & Hall,
2010).
Gene ontology analysis
The maximum RPKM value and gene names for the selected 188 genes in cluster 1 were analyzed using the Panther Classification System Database DOI:
https://doi.org/10.5281/zenodo.4495804 Released 2021‐02‐01 (Mi
et al,
2019). The genes were mapped to the mm10 genome and analyzed using the statistical overrepresentation test, which determines overrepresentation of gene ontology annotations in cluster 1 compared with all expressed genes using the Fisher's exact test and FDR correction. A list of GO biological process terms and IDs as well as associated genes was generated. Each GO term was categorized into 10 groups based on function. To determine whether a function was enriched in a gene cluster the fraction of GO terms for each biological function compared with the total number of terms for the cluster was calculated and plotted as a pie graph. Similar analysis was conducted on the 157 genes in cluster 2.
Lentiviral cloning, transfection, and infection
The cloning strategy for the lentiCRISPRv2 plasmid was used (Sanjana
et al,
2014; Shalem
et al,
2014). Briefly, the following oligos were annealed and cloned into a BsmBI (Thermo Fisher Scientific #FD0454)‐digested lentiCRISPRv2 vector: STAT2 promoter mutant (5′‐CACCGAGGGAAAGGAAACTGAAACC‐3′; 5′‐AAACGGTTTCAGTTTCCTTTCCCTC‐3′); IRF9 promoter mutant (5′‐CACCGACTCAGACCACGTGGTTTCT‐3′; 5′‐AAACAGAAACCACGTGGTCTGAGTC‐3′). Plasmids were transformed into One Shot™ Stbl3™ Chemically Competent bacteria (Thermo Fisher Scientific #C737303). Single colonies were selected and inoculated in LB Broth. Plasmids were isolated from expanded bacterial colonies using the ZymoPURE™ II Plasmid Kit (Zymo Research #D4203). Ten μg of isolated plasmid was transfected into HEK293T cells using Lipofectamine® 2000 Transfection Reagent (Invitrogen #11668–019). Two days following transfection, viral particles were isolated from the HEK293T‐conditioned media. MLE‐12 lung epithelial cells were infected with the purified viral particles for 24 h, followed by 2 μg/ml puromycin antibiotic selection.
Sanger and amplicon sequencing
Sequence mutations in the IRF9 and STAT2 promoter mutant cells were confirmed using Sanger sequencing. In brief, 50 ng of genomic DNA (gDNA) was amplified using PCR with 0.5 μM of forward and reverse primers: STAT2 promoter mutant (5′‐CCTATCCTAAGGGACGCAGG‐3′; 5′‐GCCCAACTAAAGTCTTAGCC‐3′); IRF9 promoter mutant (5′‐CAAGGTGCTACTGCTGACTG‐3′; 5′‐TTTCTGATCTCTCCTCCCC‐3′). PCR amplicons were resolved on 2% agarose gels and DNA extracted using the QIAquick Gel Extraction Kit (Qiagen #28706). Ten ng amplicons with 5 μM primers were submitted for sanger sequencing to Laragen, Inc. To quantify the number of sequence variants in the IRF9 and STAT2 promoter mutant cells, PCR amplicons (500 ng) were submitted for Amplicon‐EZ sequencing to Genewiz®.
Mathematical modeling of interferon‐induced gene expression
An ordinary differential equation (ODE) mathematical model was built to simulate RNA abundance levels as a function of transcription factor binding and RNA degradation. Briefly, the median RPKM values for all genes within each cluster from the RNA sequencing data were calculated for each time point. The temporal gene expression trajectory of the median values was generated using a modified Akima interpolant fit (MATLAB). The model was composed of four parameters (e.g., kact, activation rate; kdeg, mRNA degradation rate, Ka, association constant, and n, Hill coefficient). The model was parameterized by model fitting to the gene expression trajectories normalized to max peak. For optimization, the simplex search method (e.g., fminsearch) was implemented using an objective function of the sum of RMSD values calculated at each time point. Fifty multiple random seeds for the initial parameter values were used to optimize over a larger parameter space. The parameter optimization was run 5 times, and the top 10 parameter sets with the lowest RMSD values were averaged and used as the optimized model parameter values. Heatmaps and line graphs were plotted using MATLAB.
Mathematical modeling of the interferon signaling network
We used the previously published model of interferon signaling (Kok
et al,
2020), with minor modifications. We replaced the equation for the active receptor complex formation by
instead of
to be able to simulate the model in presence of cycloheximide which blocks protein synthesis, leading to SOCS protein decay, ultimately converging to 0. To better fit our data, we moved the delay term prior to mRNA synthesis instead prior to protein synthesis.
Parameterization
There are 41 molecular species in the mathematical model (Table
EV1). The model parameters (Table
EV2) were fit to the data, optimizing parameter value as well as which parameter to change iteratively 3 by 3 in order to adjust the minimal number of parameters allowing for a good fit. Parameters were selected at each iteration by the algorithm based on optimizing the best fit. For comparison, a particle swarm optimization algorithm (740 particles) was used to fit all parameters concurrently.
Pseudocode
1.
Initialize set of indices I−1 = {}
2.
For k ranging from 0 to …
b.
For run ranging from 1 to 20
i.
Optimize the cost function for θ = (i
3k + 1, i
3k + 2, i
3k + 3, v
1, …, v
3k + 3) using particle swarm optimization with a number of particles equals to 10× the size of θ (= 10 × (3 k + 6)).
•
i3k + 1, i3k + 2, i3k + 3 correspond to the indices of additional parameters to change,
•
v1, …, v3k corresponds to the values for previously chosen parameters, that is, with indices from Ik‐1,
•
v3k + 1, v3k + 2, v3k + 3 corresponds to the values of parameters i3k + 1, i3k + 2, i3k + 3.
ii.
If cost
run < Cost
k:
•
Ik = Ik‐1 ∪ {i3k + 1, i3k + 2, i3k + 3}
c.
If Cost
k > Cost
k‐1
i.
Run step b. with more particles number: 20× size of θ.
The cost function was the sum of the mean square error of each individual experiment, as follows:
with
and with
ysim corresponding to simulated outputs for a specific parameter set.
Implementation
The modeling and parameter optimization were done in MATLAB R2018a using the particleswarm function implemented in MATLAB's optimization toolbox.