Abstract

(TBW) Simulations of family or unrelated genotype data across multiple markers  is a task that is commonly encountered by researchers in many disciplines, including human statistical genetics and plant based mapping studies. We developed, sim1000G , a new integrated and easy to use R package that simulates multiple genetic markers in unrelated individuals or families based on a regression framework. Sim1000G can capture allele frequencies and short and long-range LD patterns in human, animal or plant studies, starting for a raw variant file. Currently, sim1000G is one of the few packages that is completely integrated within R and can simulate unrelated and family data for multiple markers.