Normalized the data set by dividing total number of each gender. It seems like the two distribution is similar each other but I can't say before analysis