file name prefix is x. The following are a preferred set of file
name conventions. Having standard file names makes the man-
agement of files easier.x.fastq.gz: zipped raw data x_trim3.fastq: after removing the 3^0
end adapter sequencesx_trim3_nodup.fastq: after removing PCR
duplicatesx_trim3_nodup_bcnn.fastq: split libraries with barcode
nn, such as 01, 02...x_trim_nodup_bcnn.fastq: after removing 5^0PARIS data (fastq)Chimerc.out.sam
Chimeric.out.junctionDG, XG, and NG groupsPARIS data (fastq)Aligned.out.sam
(non-gapped and gapped)Gapped primary readsRemove adapters
Remove PCR duplicates
Split librariesSTAR map
to genomesRemove secondary
alignments and non-
gapped reads
Remove spliced reads
and define read groupsInter-molecular groups Intra-molecular groups (sam
file and read group)IGV visualizationPhylogenetic analysis:
alignment basedRNA-RNA interactionsChimerc.out.sam
Chimeric.out.junctionSTAR map to curated
RNA references
Do not use normal
aligned readsVisualize interactionsAlternative structuresPhylogenetic analysis:
direct comparisonFig. 4The PARIS analysis pipeline, modified from the PARIS paper [21]. Major analyses outlined here include
DG and NG assembly, visualization of RNA structure data and models in IGV, two approaches of phylogenetic
analysis, analysis of alternative structures, identification and visualization of RNA–RNA interactions
74 Zhipeng Lu et al.