Acknowledgements
The authors wish to thank a number of individuals who contributed to
this study. Laboratory assistance was provided by Clare O’Connell,
Michael Kiso, Jack Lemke and Evan Miller. Nancy Rotzel and Katie Murphy
are thanked for performing the sequencing runs at the Center for
Conservation Genomics, Smithsonian Conservation Biology Institute, and
the Laboratory of Analytical Biology, National Museum of Natural
History, Smithsonian Institution, respectively. Several museums allowed
for destructive sampling of specimens; the Museum of Vertebrate Zoology,
University of California Berkeley, Chris Conroy, Jim Patton, Eileen
Lacey and Michael Nachman, Los Angeles County Museum, Jim Dines and
Kayce Bell, and the Humboldt State University Vertebrate Museum, Alyssa
Semerdjian, Nick Kerhoulas and Allison Bronson, and the University of
Michigan Museum of Zoology, Cody Thompson. We also express gratitude to
Beatrice Hahn and Jesse Connell for their assistance implementing the
CHIIMP pipeline. This study was funded by MTRH’s discretionary funds, as
well as a Grants-in-Aid award from the American Society of Mammalogists,
Sigma Xi Grants-in-Aid of Research award (G201903158734905) and the
Humboldt State University Department of Biology Master’s Student Grant.
References:
Bailey, C. A., McLain, A. T., Paquette, S. R., McGuire, S. M., Shore, G.
D., & Lei, R. (2015). Evaluating the genetic diversity of three
endangered lemur species (Genus: Propithecus) from northern Madagascar.Journal of Primatology , 5 , 132.
Bandelt, H. J., Forster, P., & Röhl, A. (1999). Median-joining networks
for inferring intraspecific phylogenies. Molecular Biology and
Evolution , 16 (1), 37–48. doi:
10.1093/oxfordjournals.molbev.a026036
Barbian, H. J., Connell, A. J., Avitto, A. N., Russell, R. M., Smith, A.
G., Gundlapally, M. S., … Wroblewski, E. E. (2018). CHIIMP: An
automated high-throughput microsatellite genotyping platform reveals
greater allelic diversity in wild chimpanzees. Ecology and
Evolution , 8 (16), 7946–7963.
Bilska, K., & Szczecińska, M. (2016). Comparison of the effectiveness
of ISJ and SSR markers and detection of outlier loci in conservation
genetics of Pulsatilla patens populations. PeerJ , 4 ,
e2504.
Blagoderov, V., Kitching, I. J., Livermore, L., Simonsen, T. J., &
Smith, V. S. (2012). No specimen left behind: industrial scale
digitization of natural history collections. ZooKeys , (209), 133.
Boleda, M. D., Briones, P., Farres, J., Tyfield, L., & Pi, R. (1996).
Experimental design: a useful tool for PCR optimization.BioTechniques , 21 (7), 134–140.
Campana, M. G., Lister, D. L., Whitten, C. M., Edwards, C. J., Stock,
F., Barker, G., & Bower, M. A. (2012). Complex relationships between
mitochondrial and nuclear DNA preservation in historical DNA extracts.Archaeometry , 54 (1), 193–202.
Crawford, A. M., Kappes, S. M., Paterson, K. A., deGotari, M. J., Dodds,
K. G., Freking, B. A., … Beattie, C. W. (1998). Microsatellite
evolution: testing the ascertainment bias hypothesis. Journal of
Molecular Evolution , 46 (2), 256–260.
Darby, B. J., Erickson, S. F., Hervey, S. D., & Ellis-Felege, S. N.
(2016). Digital fragment analysis of short tandem repeats by
high-throughput amplicon sequencing. Ecology and Evolution ,6 (13), 4502–4512.
De Barba, M., Miquel, C., Lobréaux, S., Quenette, P. Y., Swenson, J. E.,
& Taberlet, P. (2017). High-throughput microsatellite genotyping in
ecology: improved accuracy, efficiency, standardization and success with
low-quantity and degraded DNA. Molecular Ecology Resources ,17 (3), 492–507.
Demboski, J. R., Jacobsen, B. K., & Cook, J. A. (1998). Implications of
cytochrome b sequence variation for biogeography and conservation of the
northern flying squirrels (Glaucomys sabrinus) of the Alexander
Archipelago, Alaska. Canadian Journal of Zoology , 76 (9),
1771–1777.
den Tex, R.-J., Maldonado, J. E., Thorington, R., & Leonard, J. A.
(2010). Nuclear copies of mitochondrial genes: another problem for
ancient DNA. Genetica , 138 (9–10), 979–984. doi:
10.1007/s10709-010-9481-9
Duan, C., Li, D., Sun, S., Wang, X., & Zhu, Z. (2014). Rapid
development of microsatellite markers for Callosobruchus chinensis using
Illumina paired-end sequencing. PloS One , 9 (5).
Ellis, J. S., Gilbey, J., Armstrong, A., Balstad, T., Cauwelier, E.,
Cherbonnel, C., … Crozier, W. (2011). Microsatellite
standardization and evaluation of genotyping error in a large
multi-partner research programme for conservation of Atlantic salmon
(Salmo salar L.). Genetica , 139 (3), 353–367.
Estoup, A., Jarne, P., & Cornuet, J.-M. (2002). Homoplasy and mutation
model at microsatellite loci and their consequences for population
genetics analysis. Molecular Ecology , 11 (9), 1591–1604.
Fisher, P. J., Gardner, R. C., & Richardson, T. E. (1996). Single locus
microsatellites isolated using 5′ anchored PCR. Nucleic Acids
Research , 24 (21), 4369–4371.
Glenn, T. C., Nilsen, R. A., Kieran, T. J., Sanders, J. G.,
Bayona-Vásquez, N. J., Finger, J. W., … Louha, S. (2019).
Adapterama I: universal stubs and primers for 384 unique dual-indexed or
147,456 combinatorially-indexed Illumina libraries (iTru & iNext).PeerJ , 7 , e7755.
Glenn, T. C., & Schable, N. A. (2005). Isolating microsatellite DNA
loci. In Methods in enzymology (Vol. 395, pp. 202–222).
Elsevier.
Griffiths, S. M., Fox, G., Briggs, P. J., Donaldson, I. J., Hood, S.,
Richardson, P., … Preziosi, R. F. (2016). A Galaxy-based
bioinformatics pipeline for optimised, streamlined microsatellite
development from Illumina next-generation sequencing data.Conservation Genetics Resources , 8 (4), 481–486.
Grimaldi, M.-C., & Crouau-Roy, B. (1997). Microsatellite allelic
homoplasy due to variable flanking sequences. Journal of Molecular
Evolution , 44 (3), 336–340.
Haberl, M., & Tautz, D. (1999). Comparative allele sizing can produce
inaccurate allele size differences for microsatellites. Molecular
Ecology , 8 (8), 1347–1349.
Hawkins, M. T., Hofman, C. A., Callicrate, T., McDonough, M. M.,
Tsuchiya, M. T., Gutiérrez, E. E., … Maldonado, J. E. (2016).
In-solution hybridization for mammalian mitogenome enrichment: Pros,
cons and challenges associated with multiplexing degraded DNA.Molecular Ecology Resources , 16 (5), 1173–1188.
Hawkins, M. T., Leonard, J. A., Helgen, K. M., McDonough, M. M.,
Rockwood, L. L., & Maldonado, J. E. (2016). Evolutionary history of
endemic Sulawesi squirrels constructed from UCEs and mitogenomes
sequenced from museum specimens. BMC Evolutionary Biology ,16 (1), 80.
Hofreiter, M., Serre, D., Poinar, H., Kuch, M., & Paabo, S. (2001).
Ancient DNA. Nature Reviews Genetics , 2 , 353–359.
Irwin, D. M., & Kocher, T. D. (1991). Evolution of the cytochromeb gene
of mammals. Journal of Molecular Evolution , 32 (2),
128–144.
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L., & Orlando, L.
(2013). mapDamage2. 0: fast approximate Bayesian estimates of ancient
DNA damage parameters. Bioinformatics , 29 (13), 1682–1684.
Katoh, K., & Standley, D. M. (2013). MAFFT Multiple Sequence Alignment
Software Version 7: Improvements in Performance and Usability.Molecular Biology and Evolution , 30 (4), 772–780. doi:
10.1093/molbev/mst010
Kearse, M., Moir, R., Wilson, A., Stones-Havas, S., Cheung, M.,
Sturrock, S., … Thierer, T. (2012). Geneious Basic: an integrated
and extendable desktop software platform for the organization and
analysis of sequence data. Bioinformatics , 28 (12),
1647–1649.
Kiesow, A. M., Wallace, L. E., & Britten, H. B. (2011).
Characterization and isolation of five microsatellite loci in northern
flying squirrels, Glaucomys sabrinus (Sciuridae, Rodentia).Western North American Naturalist , 71 (4), 553–556.
Lane, M. A. (1996). Roles of Natural History Collections. Annals
of the Missouri Botanical Garden , 83 (4), 536–545. doi:
10.2307/2399994
Langmead, B., & Salzberg, S. L. (2012). Fast gapped-read alignment with
Bowtie 2. Nature Methods , 9 (4), 357–359. doi:
10.1038/nmeth.1923
Li, B., & Kimmel, M. (2013). Factors influencing ascertainment bias of
microsatellite allele sizes: impact on estimates of mutation rates.Genetics , 195 (2), 563–572.
Lian, C. L., Wadud, M. A., Geng, Q., Shimatani, K., & Hogetsu, T.
(2006). An improved technique for isolating codominant compound
microsatellite markers. Journal of Plant Research , 119 (4),
415–417.
Lister, A. M., & Group, C. C. R. (2011). Natural history collections as
sources of long-term datasets. Trends in Ecology & Evolution ,26 (4), 153–154.
Martin, M. (2011). Cutadapt removes adapter sequences from
high-throughput sequencing reads. EMBnet.Journal , 17 (1),
10. doi: 10.14806/ej.17.1.200
Matheson, C. D., Marion, T. E., Hayter, S., Esau, N., Fratpietro, R., &
Vernon, K. K. (2009). Removal of metal ion inhibition encountered during
DNA extraction and amplification of copper-preserved archaeological bone
using size exclusion chromatography. American Journal of Physical
Anthropology: The Official Publication of the American Association of
Physical Anthropologists , 140 (2), 384–391.
McDonough, M. M., Parker, L. D., Rotzel McInerney, N., Campana, M. G.,
& Maldonado, J. E. (2018). Performance of commonly requested
destructive museum samples for mammalian genomic studies. Journal
of Mammalogy , 99 (4), 789–802.
McKee, A. M., Spear, S. F., & Pierson, T. W. (2015). The effect of
dilution and the use of a post-extraction nucleic acid purification
column on the accuracy, precision, and inhibition of environmental DNA
samples. Biological Conservation , 183 , 70–76.
Miller, M. P., Knaus, B. J., Mullins, T. D., & Haig, S. M. (2013).
SSR_pipeline: A bioinformatic infrastructure for identifying
microsatellites from paired-end Illumina high-throughput DNA sequencing
data. Journal of Heredity , 104 (6), 881–885.
Miller, W., Drautz, D. I., Janecka, J. E., Lesk, A. M., Ratan, A.,
Tomsho, L. P., … Qi, J. (2009). The mitochondrial genome sequence
of the Tasmanian tiger (Thylacinus cynocephalus). Genome
Research , 19 (2), 213–220.
Morin, P. A., Manaster, C., Mesnick, S. L., & Holland, R. (2009).
Normalization and binning of historical and multi-source microsatellite
data: overcoming the problems of allele size shift with allelogram.Molecular Ecology Resources , 9 (6), 1451–1455.
O’reilly, P. T., Canino, M. F., Bailey, K. M., & Bentzen, P. (2000).
Isolation of twenty low stutter di-and tetranucleotide microsatellites
for population analyses of walleye pollock and other gadoids.Journal of Fish Biology , 56 (5), 1074–1086.
Oshida, T., Lin, L. K., Masuda, R., & Yoshida, M. C. (2000).
Phylogenetic relationships among Asian species of Petaurista (Rodentia,
Sciuridae), inferred from mitochondrial cytochrome b gene sequences.Zoologica Science , 17 (1), 123–128.
Paabo, S., Poinar, H., Serre, D., Jaenicke-Despres, V., Hebler, J.,
Rohland, N., … Hofreiter, M. (2004). Genetic Analyses from
Ancient DNA. Annual Review of Genetics , 38 (645–679).
Piggott, M. P., Bellemain, E., Taberlet, P., & Taylor, A. C. (2004). A
multiplex pre-amplification method that significantly improves
microsatellite amplification and error rates for faecal DNA in limiting
conditions. Conservation Genetics , 5 (3), 417–420.
Pimentel, J. S., Carmo, A. O., Rosse, I. C., Martins, A. P., Ludwig, S.,
Facchin, S., … Kalapothakis, E. (2018). High-throughput
sequencing strategy for microsatellite genotyping using neotropical fish
as a model. Frontiers in Genetics , 9 , 73.
Pontiroli, A., Travis, E. R., Sweeney, F. P., Porter, D., Gaze, W. H.,
Mason, S., … Wellington, E. M. H. (2011). Pathogen quantitation
in complex matrices: a multi-operator comparison of DNA extraction
methods with a novel assessment of PCR inhibition. PloS One ,6 (3).
Regnaut, S., Lucas, F. S., & Fumagalli, L. (2006). DNA degradation in
avian faecal samples and feasibility of non-invasive genetic studies of
threatened capercaillie populations. Conservation Genetics ,7 (3), 449–453.
Rizzi, E., Lari, M., Gigli, E., De Bellis, G., & Caramelli, D. (2012).
Ancient DNA studies: new perspectives on old samples. Genetics
Selection Evolution , 44 (1), 21.
Robertson, J. M., & Walsh-Weller, J. (1998). An introduction to PCR
primer design and optimization of amplification reactions. InForensic DNA profiling protocols (pp. 121–154). Springer.
Rohland, N., & Reich, D. (2012). Cost-effective, high-throughput DNA
sequencing libraries for multiplexed target capture. Genome
Research , gr.128124.111. doi: 10.1101/gr.128124.111
Šarhanová, P., Pfanzelt, S., Brandt, R., Himmelbach, A., & Blattner, F.
R. (2018). SSR-seq: Genotyping of microsatellites using next-generation
sequencing reveals higher level of polymorphism as compared to
traditional fragment size scoring. Ecology and Evolution ,8 (22), 10817–10833.
Schmieder, R., & Edwards, R. (2011). Quality control and preprocessing
of metagenomic datasets. Bioinformatics (Oxford, England) ,27 (6), 863–864. doi: 10.1093/bioinformatics/btr026
Shapiro, B. (2012). Ancient DNA: Methods and Protocols . Retrieved
from http://library.wur.nl/WebQuery/clc/1989945
Silva, P. I., Martins, A. M., Gouvea, E. G., Pessoa-Filho, M., &
Ferreira, M. E. (2013). Development and validation of microsatellite
markers for Brachiaria ruziziensis obtained by partial genome assembly
of Illumina single-end reads. Bmc Genomics , 14 (1), 17.
Smith, A. B., Santos, M. J., Koo, M. S., Rowe, K. M., Rowe, K. C.,
Patton, J. L., … Moritz, C. (2013). Evaluation of species
distribution models by resampling of sites surveyed a century ago by
Joseph Grinnell. Ecography , 36 (9), 1017–1031.
Thatte, P., Joshi, A., Vaidyanathan, S., Landguth, E., & Ramakrishnan,
U. (2018). Maintaining tiger connectivity and minimizing extinction into
the next century: Insights from landscape genetics and
spatially-explicit simulations. Biological Conservation ,218 , 181–191.
Tibihika, P. D., Curto, M., Dornstauder-Schrammel, E., Winter, S.,
Alemayehu, E., Waidbacher, H., & Meimberg, H. (2019). Application of
microsatellite genotyping by sequencing (SSR-GBS) to measure genetic
diversity of the East African Oreochromis niloticus. Conservation
Genetics , 20 (2), 357–372.
Untergasser, A., Cutcutache, I., Koressaar, T., Ye, J., Faircloth, B.
C., Remm, M., & Rozen, S. G. (2012). Untergasser, Andreas, et al.
”Primer3—new capabilities and interfaces. Nucleic Acids
Research , 40 (15), e115.
Vartia, S., Villanueva-Cañas, J. L., Finarelli, J., Farrell, E. D.,
Collins, P. C., Hughes, G. M., … Cross, T. F. (2016). A novel
method of microsatellite genotyping-by-sequencing using individual
combinatorial barcoding. Royal Society Open Science , 3 (1),
150565.
Wang, C., & Rosenberg, N. A. (2012). MicroDrop: a program for
estimating and correcting for allelic dropout in nonreplicated
microsatellite genotypes version 1.01. See Https://Web. Stanford.
Edu/Group/Rosen/Berglab/Microdrop. Html .
Weiß, C. L., Schuenemann, V. J., Devos, J., Shirsekar, G., Reiter, E.,
Gould, B. A., … Burbano, H. A. (2016). Temporal patterns of
damage and decay kinetics of DNA retrieved from plant herbarium
specimens. Royal Society Open Science , 3 (6), 160239.
White, L. C., Mitchell, K. J., & Austin, J. J. (2018). Ancient
mitochondrial genomes reveal the demographic history and phylogeography
of the extinct, enigmatic thylacine (Thylacinus cynocephalus).Journal of Biogeography , 45 (1), 1–13.
Williams, J. F. (1989). Optimization strategies for the polymerase chain
reaction. Biotechniques , 7 (7), 762–769.
Williams, S. L. (1999). Destructive preservation: A review of the
effect of standard preservation practices on the future use of natural
history collections .
Yuan, S. (2020). PHYLOGENETIC AND POPULATION GENETIC ANALYSIS OF
THE HUMBOLDT’S FLYING SQUIRREL USING HIGH-THROUGHPUT SEQUENCING DATA .
Humboldt State University, Arcata, CA.
Zhan, L., Paterson, I. G., Fraser, B. A., Watson, B., Bradbury, I. R.,
Nadukkalam Ravindran, P., … Bentzen, P. (2017). MEGASAT:
automated inference of microsatellite genotypes from sequence data.Molecular Ecology Resources , 17 (2), 247–256.
Zittlau, K. A., Davis, C. S., & Strobeck, C. (2000). Characterization
of microsatellite loci in northern flying squirrels (Glaucomys
sabrinus). Molecular Ecology , 9 (6), 826–827.
Figures:
Figure 1: Scatter plot of average quality of PCR replicates following QC
from prinseq-lite. The replicates for each specimen are shown across the
x axis, and the % of ‘good’ reads are shown on the y-axis. Samples are
sorted by type: tissue, high quality museum specimen and low quality
museum specimens. Individuals of the same type are separated by a dashed
line.
Figure 2: A flow chart of best practices for sample analysis performed
here, with particular emphasis on how to assess low quality museum
specimens. Note: many steps detailed under ‘CHIIMP Output’ can be
manipulated when running the pipeline.
Tables:
Table 1: Summary of samples used in this study. Color coded throughout
as green= tissue sample, blue = high quality museum specimen (also
referred to as HQMS hereafter), and red = low quality museum specimens
(LQMS). Samples were acquired from approved destructive sampling
requests from several national museum collections as follows: HSU=
Humboldt State University Vertebrate Museum, UMMZ= University of
Michigan Museum of Zoology, MVZ= Museum of Vertebrate Zoology,
University of California Berkeley, and LACM= Los Angeles County Museum.
Table 2: Primers used for microsatellite and mitochondrial cytochrome b
amplification. Microsatellites were characterized by the length
(S=short, under 150 bp, M=medium, 150-200 bp, and L= long, over 200 bp)
and repeat motif as simple (standard dinucleotide repeat motif in all
included microsatellites) and complex (where repeat motifs were
interrupted by variable repeat units- only seen in GLSA12 here). The
reverse primer for mitochondrial cytochrome b was newly designed
for this study as previously published internal cytochrome bprimers did not amplify in G. oregonensis .
Table 3. Quality of individually prepared libraries as assessed by
prinseq-lite. The sample name and replicate number are indicated in the
Sample ID field, and the ‘all’ row includes the replicate counts when
bioinformatically summed together. The ‘combined’ ID (also shown in
bold) represents the second library prep where all microsatellite
replicates + cytochrome b were pooled and run through the CHIIMP
pipeline. Only quality data from the forward read (R1) is shown here.
Samples retain the same color coding for tissue= green, HQMS= blue and
LQMS= red. Average quality is shown per replicate, read counts and the
percentage of reads passing prinseq-lite quality filtering are shown
across all microsatellite replicates as well. Range was calculated from
each individual replicate and excluded bioinformatically summed and
combined library prep data.
Table 4: Summary of recovered genotypes from the CHIIMP pipeline
(Barbian et al. 2018). All recovered genotypes are provided, any areas
where a ‘-’ is found indicates no recovered genotype from that
replicate. The accuracy across each amplification was calculated as well
as the average accuracy per microsatellite and across sample type
(tissue, HQMS and LQMS). The bioinformatically combined dataset as well
as the pooled dataset genotypes are also provided. CHIIMP output
provides various metrics on quality and as such a * represents where
possible PCR stutter was removed, a ▵ represents where PCR artifacts
were removed and ❖ represents where more than two prominent sequences
were found. Genotypes shown in red italics were generated from less than
500 raw reads and should not be used in downstream analyses.
Table 5: Descriptive Statistics of samples sorted by type, either
‘tissue’, ‘HQMS’ or ‘LQMS’.
Table 6: MicroDrop results for bioinformatically combined datasets with
a comparison of the raw, initial CHIMP output to the final, processed
data. Both locus specific and individual rates of estimated allelic
dropout are provided.
Appendices
Appendix 1: PCR Amplification success of the five included
microsatellites and mitochondrial cytochrome b gene. Success was
determined by the presence of a band in the expected size range from an
agarose gel. Note: HSU 8180 represents a tissue sample so only two PCR
replicates of each marker were performed.
Appendix 2: Coverage of cytochrome b across all samples,
bowtie 2 v 2.3.0 was used to map reads and Geneious Prime calculated the
included quality metrics. Quality metrics provided from the 300 bp
fragment amplified in all samples except HSU 8180 for which the entire
CDS (1,140 bp) was amplified and analyzed.
Appendix 3: Mitochondrial minimum spanning network.