All of our networks were  implemented in and trained with PyTorch \cite{paszke2017automatic} with 150 iterations of a dataset of binary and decimal digits from 1 to 1024.