Fig. 1
From: Predicting onset of complications from diabetes: a graph based approach

Above is a flowchart depicting the cleaning and standardization process which (1) combines and QCs the raw data files, (2) combines variables, standardizes, and cleans using a dictionary specific to the data source, and finally (3) removes variable outliers and normalizes using a universal clinical parameter dictionary