3 Models

Think of your model(s) at the physical location of raw data and metadata. In the more formal sense, there is an entire field of data engineering and science that has to do with database design and modeling. An excellent way to get your feet wet is understanding database normalization and take a look at some entity-relationship diagrams. You will quickly realize the limitations of your Frankenstein multi-tab Excel sheet.

3.1 What makes a good model?

3.2 Love the data you have

Data is everywhere but you only have access to some of it. Getting access is often time consuming. Once you have access to data you should consistently assess its strengths, weaknesses, opportunities, and threats (SWOT). In the name of deliverables, you have to have endpoints. Your productivity will be judged by the consistent quality and quantity of work.

3.3 Data sources

3.4 EMR data (Relational)

3.5 SQL database

3.6 Transactional vs Analytical Data

OLTP and OLAP

3.7 Data Structure

3.7.1 Wide/Horizontal/De-Normalized Data

Excel. Every row is a person and every column is information about them.

3.7.2 Long/Vertical/Normalized Data

Separate related tables

3.8 Metadata

3.8.1 Coded data

3.9 File Types

3.9.1 csv

3.9.2 xlsx

3.9.3 json

3.10 REDCap

3.11 Other Examples

3.11.1 Qualtrics

3.11.2 Google forms

3.13 quality checks