Section 4 Case 2

We illustrate how to perform integrative analysis combining genetic interactions and gene expression data; such strategy is computationally promising given the current technological limits in experimentally generating tissue-specific interactions on a genome scale, particularly for humans. One way doing so is to trim the generic genetic interactions to node genes expressed in a specific tissue. Such trimming allows identification of a genetic interaction network in the whole blood (Step 2), from which a subnetwork with a desired number of interconnected genes is further identified that tend to be highly expressed (Step 3). We also demonstrate how to highlight the subnetwork within the parent network (Step 4). For the interpretation of the subnetwork identified, we illustrate how to perform phenotype enrichment analysis using mammalian phenotype ontology, a tree-like structure containing well-defined terms that are used to annotate mouse knock-out phenotypes (Step 6).

Step 1: Load the packages and import human genetic interaction data as well as gene expression data (see Materials).

Step 2. Identify a genetic interaction network in the whole blood (FIGURE 4.1).

Network visualisation of genetic interactions containing 756 nodes/genes (expressed in the human whole blood) and 725 edges (notably, not all interconnected). Nodes sized by degree (i.e. the number of interacting neighbors).

FIGURE 4.1: Network visualisation of genetic interactions containing 756 nodes/genes (expressed in the human whole blood) and 725 edges (notably, not all interconnected). Nodes sized by degree (i.e. the number of interacting neighbors).

Step 3: Further identify a subnetwork with highly expressed genes (FIGURE 4.2).

Illustration of a subnetwork identified from the parent network, ensuring the subnetwork has a desired number (here ~30) of interconnected nodes/genes that tend to be highly expressed in the whole blood. Nodes colored by the median expression level, that is, transcripts per million (TPM).

FIGURE 4.2: Illustration of a subnetwork identified from the parent network, ensuring the subnetwork has a desired number (here ~30) of interconnected nodes/genes that tend to be highly expressed in the whole blood. Nodes colored by the median expression level, that is, transcripts per million (TPM).

Step 4. Highlight the subnetwork within the parent network.

The parent network highlighted with the subnetwork. The layout (node coordinates) preserved.

FIGURE 4.3: The parent network highlighted with the subnetwork. The layout (node coordinates) preserved.

Step 5. Display expression levels for genes in the subnetwork (FIGURE 4.4).

Boxplot of genes in the subnetwork, showing expression distribution across the whole blood samples.

FIGURE 4.4: Boxplot of genes in the subnetwork, showing expression distribution across the whole blood samples.

Step 6. Perform phenotype enrichment analysis for genes in the subnetwork (FIGURE 4.5).

Circular illustration of mouse phenotypes enriched in subnetwork genes. Based on mammalian phenotype ontology used for annotating mouse knockout genes.

FIGURE 4.5: Circular illustration of mouse phenotypes enriched in subnetwork genes. Based on mammalian phenotype ontology used for annotating mouse knockout genes.