Science - USA (2022-04-08)

(Maropa) #1

and genes where this localized mutation pat-
tern occurred.
Finally, we explored whether characteriz-
ing mutation events around tissue-specific genes
could offer insights into tumor biology. We
hypothesized that these events might be con-
nected to the cell of origin from which a tumor
developed, given that these genes exhibited (i)
tissue-specific expression (Fig. 4D), (ii) lower
expression in tumor cells than in normal cells
(Fig. 4, E and F, and fig. S36, D to F), and (iii)
physiological roles in their respective tissues
(fig. S26, B and C). Consistent with this hy-
pothesis, many tissue-specific genes were het-
erogeneously expressed in single-cell data
from normal tissues (fig. S37, A and B), par-
ticularly those harboring mutation events (fig.
S37, C and D). For instance, in single-cell ex-
pression data for liver ( 31 ), most tissue-specific
genes with mutation events were differentially
expressed (87.5%; 35/40) between cells from


different histological zones (Fig. 4, G to I, and
fig. S38, A to D) compared with 15.5% for ar-
bitrary genes expressed in the liver (P< 0.001,
Fisher’s exact test). Similarly, in single-cell ex-
pression data for kidney ( 32 ), all tissue-specific
genes with mutation events were expressed
in a specific cell type (proximal tubule cells,
100%; 5/5) (fig. S38, E and F) compared with
26.4% for arbitrary, heterogeneously expressed
genes (P= 0.001, Fisher’s exact test). Likewise,
papillary and clear-cell kidney tumors, which
originate from proximal tubule cells, carried
mutations around tissue-specific genes more
frequently than chromophobe kidney tumors
that originate from collecting-duct epithelial
cells ( 33 ) (60.9 versus 14.0%;P< 0.001, Fisher’s
exact test) (fig. S38G).
Our analyses thus established a general, recip-
rocal link among a localized mutation pattern
in tumor genomes, tissue-specific expression
in bulk expression data, and heterogeneous

expression in single-cell data of the related
normal tissue. Therefore, the localized muta-
tion pattern around tissue-specific genes may
reflect a potential imprint of the characteristic
expression program of the cell type from which
a tumor originated (Fig. 4, G to I, and figs. S37
and S38), which could be of use in diagnostics.

Evaluation of mutation events in promoter and
enhancer regions
We next used the following analyses to further
assess the noncoding mutation events in reg-
ulatory promoter and enhancer regions.

Transcription factor binding sites
We used a permutation test to identify recur-
rent mutations that changed transcription
factor binding motifs in the JASPAR database
( 34 ) (see the materials and methods). This
test revealed that mutations changed binding
motifs in 15.1% (11/73) of our findings in the

Dietleinet al.,Science 376 , eabg5601 (2022) 8 April 2022 6 of 12


0

50

(^100) 1kb
chr5 1.1 TERT 1.4 1.5
Bladder 1kb
chr5 1.1 TERT 1.4 1.5
Kidney 1kb
chr5 1.1 TERT 1.4 1.5
Liver 1kb
chr5 1.1 TERT 1.4 1.5
Thyroid
0
50
(^100) 1kb
chr17 MIR21 57.95 58.0
Gastric 1kb
chr17 MIR21 57.95 58.0
Lung 1kb
RAD51B 69.369.55
Bladder 1kb
RAD51B 69.369.55
Breast
0
50
(^100) 1kb
chr21 BTG3 19.0 19.05
Bladder 1kb
chr22 XBP1 29.2 29.21 29.22
Breast 1kb
chr6BACH2 91.191.2
Leukemia 1kb
chr15 B2M 45.0245.0345.04
Lung
0
50
(^100) 1kb
chr14 CDCA4 105.5 105.52
Bladder 1kb
chr9 RMRP 35.659 35.66
Breast 1kb
chr3 187.4 BCL6 187.475
Leukemia 1kb
chr10KLF6 3.84 3.85 3.86
Lung
0
50
(^100) 1kb
chr1PIK3C2B 204.5 204.6
Bladder 1kb
chr19 MED16 0.95 1.0
Liver 1kb
chr2 136.86CXCR4 136.88
Leukemia 1kb
chr9 SNAPC3 15.5 15.6 15.7
Lung
0
50
(^100) 1kb
chr12 MGP 15.05
Breast 1kb
chr4 ALB 74.3 74.35
Liver 1kb
chr4 ADH1B 100.275100.3
Liver 1kb
chr3 148.45CPB1 148.6 148.7
Pancreas
0
50
(^100) 1kb
chr10 LIPF 90.45 90.5
Gastric 1kb
chr10 CYP2C8 96.9 96.95
Liver 1kb
chr18 MIR12256.12 56.125 56.13
Liver 1kb
chr19 51.34 KLK3 51.37 51.38
Prostate
0
50
(^100) 1kb
chr2 SFTPB 85.985.9185.92
Lung 1kb
chr7 CYP3A5 99.3 99.4
Liver 1kb
chr3 133.4TF 133.55133.6133.65
Liver 1kb
chr8 133.0 TG 135.0
Thyroid
noncoding mutation events in promoter and enhancer regions
noncoding mutation events around tissue-specific genes
position around significantly mutated genomic region (Mbp)
mutation rate (% max)
mutation rate (% max)
A
B
Fig. 3. Categories of mutation events exhibit different mutation patterns.Positional clustering of mutations (y-axis, percentage of maximum) plotted
against genomic positions (x-axis) around mutation events that fall into regulatory regions [(A), orange] or overlap with tissue-specific genes [(B), teal]. Genomic
boundaries of the closest gene are marked at the bottom of each plot, and white arrowheads mark the direction of its transcription.
RESEARCH | RESEARCH ARTICLE

Free download pdf