In the present day, we are creating more data than ever before and at an exponential rate. This information can be used for purposes that were unprecedented when data was first collected. Enhancements to technology and computing power have been critical in making sense of the data that is available globally. The growth in distributed databases, where data is stored via a centralised database across several platforms instead of a single platform, allows for highly-scalable parallel processing of vast amounts of data. This can decrease processing time by several orders of magnitude for many applications.