During this hour we will prepare a real-life data set for analysis, taking the raw data and building the analysis set step by step.
We will pause twice to explore some of the implementation details of padr’s workhorses, the functions pad and thicken. Ample time will be reserved to ask questions, both during and after the presentation.
Please see the full list of NHS-R Conference Workshops available here
Edwin got his MsC in applied statistics at Leiden University and has been working as a data scientist for almost a decade now. For the past five years he has been employed by funda, the main housing platform in the Netherlands. In his daily job he focuses on developing, productionizing and maintaining machine learning models and other data products, as well as performing ad hoc data analysis to answer business questions. At work he uses a mix of R, python and sql. In 2017 he released the padr package on CRAN, which has accrued 1.4 million downloads over the last 6 years.