How can reference sequence files be constructed for uncommon species?_Q&A_MobiVision_Software_Support_

Constructing a reference genome sequence file for particularly uncommon species can be challenging due to the lack of available reference genomes or genome annotation data. Here are some potentially useful methods:

1. Based on Known Related Species: Youcan use the genome sequence of a closely related species, utilizing genomesequence alignment and assembly techniques to construct the referencegenome sequence for the target species. This method requires sufficientsimilarity in genome sequences and adequate alignment depth.

2. RNA-Seq Data Assembly: If RNA-Seqdata for the target species is available, it can be used to assembletranscriptome sequences, which can then be used with alignment tools likeBLAST and STAR for genome assembly and annotation. This method is suitablewhen a full genome sequence is not required.

3. Subgenome Annotation: If no genomedata is available, subgenome annotation methods can be considered. Thisinvolves using alignment tools like BLAST and HMMER to map the genomeannotation information from known species to the target species' genome,inferring the composition and structure of the target species' genome.

4. Third-Generation Sequencing-Assisted Assembly: Consider using third-generation sequencing technologies, suchas Oxford Nanopore and PacBio SMRT, which produce long reads that aid inbetter genome assembly and annotation.

For species with available genome FA files and genome annotation GTF or GFF files, the mk_vdj_ref command can be used for construction. For specific construction methods, please refer to the corresponding page link.

Support Center