About Datasets

The Physics Practicals have a standard format for all datasets. Most locally written software automatically can read and write datasets using this format. Examples of locally written sofrtware include CreateDataset, ViewDataset, and PolynomialFit. Here we describe what that format is.

The dataset file is text.

The first line of the file is the title of the dataset.
The second line of the file names the variables of the data. The names are separated by tabs. In the examples below we represent a tab with: <TAB>
The third and subsequent lines of the file contain the data. Each datapoint is on a separate line and the values are separated by tabs.

Thus, the dataset can be edited with a text editor or a spreadsheet program such as Excel. There are four cases for the number of variables in the dataset.

One Variable

If only one value is given for each datapoint, it is the dependent (i.e. y) variable. In this case the values of the independent (x) variable are assumed to be 1, 2, 3, … in order.

Here is an example of such a dataset:

Balonium decay values
Counts per second
50
32
27
15
11
8

Two Variables

In this case the first column contains the values for the independent (x) variable and the second column the values for the dependent (y) variables. For example:

Student collected data on pressure-temperature values
Pressure (cm Hg)<TAB>Temperature (C)
65<TAB>-10
75<TAB>17
86<TAB>42

Three Variables

If there are three variables, the third one is the uncertainty in the dependent (y) variable.

Thermocouple Calibration Data
Temp (C)<TAB>Voltage (Volts)<TAB>errV
0<TAB>-0.89<TAB>0.05
5<TAB>-0.69<TAB>0.05
10<TAB>-0.53<TAB>0.05

Four Variables

Now there are explicit uncertainties in both coordinates of the data. The first column contains the name and values of the independent (x) variable, the second column contains the name and values of the uncertainty in the independent variable, the third column contains the name and values of the dependent (y) variable and the fourth column the name and values of the uncertainty in the dependent variable.

Pearson's Data with York's Weights
X<TAB>errX<TAB>Y<TAB>errY
0<TAB>0.0316<TAB>5.9<TAB>1
0.9<TAB>0.0316<TAB>5.4<TAB>0.746
1.8<TAB>0.0447<TAB>4.4<TAB>0.5

This document was written by David M. Harrison, Dept. of Physics, Univ. of Toronto in December 2007.