1. The file RIMS.SYD contains three variables and 10 cases. Each case represents ceramic vessel; the variable DIAMETER is the rim diameter in cm, VOLUME is the vessel volume in liters, and SIZE is the floor area in square meters of the house in which the vessel was found.
2. The dataset WARRIOR.SYD contains real data on Mississippian settlements in the Black Warrior valley. SITE$ is the site name. TYPE$ records whether a site is a "village" (without mounds) or a "center" (with at least one mound). SIZE is the extent of the site's midden area in acres. ACRES records the number of acres of arable land within a 0.6 mile walk of the site. And BUSHELS is an estimate of the annual productivity of that farmland in thousands of bushels of corn.
3. Using the data in KNOLL.SYD, plot CONCHS against LEVEL and fit
a lowess curve to the resulting scatter. (You can do this by using
the SMOOTH=LOWESS option of the PLOT command, or the “LOWESS” smoother
in the scatterplot options dialog box.) How does changing the F parameter
(i.e., the TENSION option) affect the results?
4. Using the data in POINTS.SYD, compute the degree of association between the variables FACETS$ and CONTEXT$. Compute phi, phi/phi_max (Cole's C7), and Yule's Q and compare the results (you can use the Statistics | Tables | Crosstabs | Two-way procedure to construct the contingency table and compute some of the coefficients; note the Statistics button in the lower right corner of the dialog box). What do these results mean?
Datasets for this exercise (right-click to download):