Designing a Database for the Cod Stomach Data Set

Paul Murrell


Table of Contents

Introduction
Variables
Data format
Exercises

Introduction

The data are measurements made on the stomach contents of Atlantic Cod. The data set described herein is provided in a colon-delimited ASCII text file format. This is a small sample from a larger data set with more stomach contents and more variables.

Variables

The data set contains the following variables:

region -
shipType - The type of ship (e.g., commercial versus research).
shipID - Unique ship identifier.
trip - Trip identifier; unique within a region.
set - Set identifier; unique within a particular ship on a particular trip. A set is, e.g., one set of a fishing net.
fishID - Unique fish identifier.
fishLength - Length of a fish (mm).
preyMass - Weight of lump from cod stomach (grams).
preyType - Species of lump from cod stomach. Special value Empty means no lumps found in stomach. Rare species gathered together under special label Other.

Data format

The data set is provided as a colon-delimited ASCII text file called cod.txt.

The file CodMeta.xml provides a StatDataML description of the data set.

Exercises

Designing a database
Q: Designing a database

This data set is provided in a plain text format. The task in this exercise is to produce a database design to store these data in third normalised form.