Plan
- What’s the first thing to do when you are ready to enter data?
- Plan a data structure follows tidy data rules
- Ideally plan before you collect data and match datasheets to entry format
Where to enter data
- Spreadsheet
- Text file
- Database
-
Form (web or GUI databases)
- All of these are reasonable options
- When used properly spreadsheets or forms for databases can provide additional protection against bad data being entered
- Be careful of data conversion issues from spreadsheets
Show posts/papers on dates and gene names
Quality Assurance
- Stopping bad data from ever being entered
-
Set rules about what values can be entered into a column
- Select an area of cells, most often a column
Data -> Data Validation- Choose the data type:
Whole numbers(only that type can be entered) - Set limitations
- Use
Datato set the type of limitation:between - Then use additional boxes to provide specifics:
1and10
- Use
- Add a message to explain what goes in a cell in
Input Message -
Add a useful error message using
Error Alert - Make lists of choices
Allow=List- Enter list values in
Sources: DM, DO, DS, PP, PM
Demo a data entry form in Excel or Google Forms
Quality Control
- Looking for bad data that has already been entered
- Sort
- Graph
- Check for realistic ranges of values
