3.1 Genome assembly
The genome size of C. japonica was estimated to be 504 Mb by flow
cytometry, which is consistent with that estimated by K-mer analysis
(K=35) using Illumina short reads (Figure 2a). The heterozygosity is
0.47%.
With 169.37 Gb PacBio long reads, we successfully assembled a
high-quality genome of 584,506,556bp, with Contig N50 of 12 Mb. A total
of 771 contigs were obtained, with the longest contig as 24,253,087 bp.
BUSCO analysis with insecta_ODB10 showed that the gene space is 95.6 %
of complete genes, suggesting the assembled genome is of high quality
and suitable for further analysis. (Table 1). A total of 97.15%
assembled genome sequences were anchored to 31 chromosomes by Hi-C
scaffolding, with Scaffold N50 of 20,239,873bp (Figure 2b).