The full-length genome sequence of HNJZ-S1
Fragments of the HNJZ-S1 genome were amplified using specific primers along with 3’ and 5’ RACE kits. The full-length genome sequence was 11689 bp in length with base composition 19.57% T, 26.16% C, 28.08% A and 26.19% G. The genomic composition and the size of the ORFs were basically consistent with GETV strains collected in GenBank. The genome included a 5’ untranslated region (UTR), a 3’ UTR and two ORFs in between. Seventy-eight nucleotides at the 5’ end and 401 nucleotides at the 3’ end were UTRs; the 7407 nucleotides following the 5’ UTR encoded four non-structural proteins (NSP1, NSP2, NSP3 and NSP4), and the 3759 nucleotides before the 3’ UTR encoded five structural proteins (C, E3, E2, 6K and E1). The 44 nucleotides in between the two ORFs were non-coding connecting regions (Table 1 ). The whole sequence was uploaded into the GenBank with accession number KY3638.