# download sequences and coordinates of grape and peachpython-mjcvi.apps.fetchphytozomeVvinifera,Ppersica# convert the GFF to BED filepython-mjcvi.formats.gffbed--type=mRNA--key=ID--primary_onlyVvinifera_145_Genoscope.12X.gene.gff3.gz-ogrape.bedpython-mjcvi.formats.gffbed--type=mRNA--key=ID--primary_onlyPpersica_298_v2.1.gene.gff3.gz-opeach.bed# clean headers to remove description fields from Phytozome FASTA filespython-mjcvi.formats.fastaformatVvinifera_145_Genoscope.12X.cds.fa.gzgrape.cdspython-mjcvi.formats.fastaformatPpersica_298_v2.1.cds.fa.gzpeach.cds
如果仔细观察,还可以发现3个共线性区域中常常有一个信号更强,对应着两个基因组之间的直系同源区域。如果我们只想要得到这些1:1直系同源的区域呢?我们只需要重复之前的比对,同时加上选项--cscore=.99即可。C-score是由LAST比对区域到BLAST比对区域的比值确定。(C-score is defined by the ratio of LAST hit to the best BLAST hits to either the query and hit)。0.99的C-score阈值有效地过滤LAST比对结果,从而得到最佳相互比对结果(reciprocal best hit, RBH)。
# remove old filermgrape.peach.last.filteredgrape.peach.anchorspython-mjcvi.compara.catalogorthologgrapepeach--cscore=.99
g*GSVIVT01012028001GSVIVT01000604001ppa011886mppa008534m392+GSVIVT01010441001GSVIVT01000970001ppa022891mppa001358m115-...# g: green; r: red; etc.
多物种
layout
# y, xstart, xend, rotation, color, label, va, bed.7,.2,.4,45,,Grape,top,grape.bed.5,.2,.8,0,,Peach,top,peach.bed.7,.4,.8,-45,,Cacao,bottom,cacao.bed# edgese,0,1,grape.peach.anchors.simplee,1,2,peach.cacao.anchors.simple
# y, xstart, xend, rotation, color, label, va, bed.5,.025,.625,60,,Grape,top,grape.bed.2,.2,.8,0,,Peach,top,peach.bed.5,.375,.975,-60,,Cacao,top,cacao.bed# edgese,0,1,grape.peach.anchors.simplee,1,2,peach.cacao.anchors.simplee,0,2,grape.cacao.anchors.simple