Importantly, our key observations regarding vernacularization etc. are supported by similar trends across multiple independently maintained bibliographies. This exemplifies the power of such approach in uncovering broad patterns in knowledge production, which are robust to occasional inaccuracies in the data. Whereas documentation and polishing continue, we have done all source code openly available, so that every detail of the data processing can be independently investigated and verified. Obtaining valid conclusions depends on efficient and reliable harmonization and augmentation of the raw entries. This paper demonstrates how such challenges can be overcome by specifically tailored data analytical ecosystems that provide scalable tools for data processing and analysis.