Center for Data Driven Discovery in Biomedicine
Giovanni Santia is a Data Engineer responsible for processing and storing clinical data for assorted projects, including the Gabriella Miller Kids First Data Resource Center. He manages data according to each dataset’s protocols, and organizes the data as it arrives from disparate sources. After data is received, Giovanni works to clean up and transform the data into a standardized format in an effort to facilitate the best future use of the data by the research community. This cleaning process employs a wide variety of techniques, including standard munging along with machine learning and data science.
Giovanni earned a BS in both math and physics from the University of Michigan before completing an MS in math from Wayne State University. He is currently pursuing a PhD in data science from Drexel University. He feels fortunate to be able to use data science to help other people, especially children.