The boxplot below displays the distribution of all gold medal-winning heights for the women's high jump, \(\textit{Wgold}\), in metres, for the 19 Olympic Games held from 1948 to 2020. --- 2 WORK AREA LINES (style=lined) --- --- 1 WORK AREA LINES (style=lined) --- --- 3 WORK AREA LINES (style=lined) --- --- 2 WORK AREA LINES (style=lined) ---
Data Analysis, GEN1 2024 VCAA 5 MC
The number of siblings of each member of a class of 24 students was recorded.
The results are displayed in the table below.
\begin{array}{|c|c|c|c|c|c|c|c|c|c|c|c|}
\hline
\rule{0pt}{2.5ex} \quad 2 \quad \rule[-1ex]{0pt}{0pt} & \quad 1 \quad & \quad 3 \quad & \quad 2 \quad & \quad 1 \quad & \quad 1 \quad & \quad 1 \quad & \quad 4 \quad & \quad 1 \quad & \quad 1 \quad & \quad 1 \quad & \quad 1 \quad \\
\hline
\rule{0pt}{2.5ex} 1 \rule[-1ex]{0pt}{0pt} & 2 & 1 & 2 & 2 & 1 & 3 & 4 & 2 & 2 & 3 & 1 \\
\hline
\end{array}
A boxplot was constructed to display the spread of the data.
Which one of the following statements about this boxplot is correct?
- There are no outliers.
- The value of the interquartile range (IQR) is 1.5
- The value of the median is 1.5
- All of the five-number summary values are whole numbers.
CORE, FUR1 2021 VCAA 5 MC
CORE, FUR2 2020 VCAA 3
In a study of the association between BMI and neck size, 250 men were grouped by neck size (below average, average and above average) and their BMI recorded.
Five-number summaries describing the distribution of BMI for each group are displayed in the table below along with the group size.
The associated boxplots are shown below the table.
- What percentage of these 250 men are classified as having a below average neck size? (1 mark)
- What is the interquartile range (IQR) of BMI for the men with an average neck size? (1 mark)
- People with a BMI of 30 or more are classified as being obese.
- Using this criterion, how many of these 250 men would be classified as obese? Assume that the BMI values were all rounded to one decimal place. (1 mark)
- Do the boxplots support the contention that BMI is associated with neck size? Refer to the values of an appropriate statistic in your response. (2 marks)
CORE, FUR2-NHT 2019 VCAA 2
The five-number summary below was determined from the sleep time, in hours, of a sample of 59 types of mammals.
\begin{array} {|l|c|}
\hline
\rule{0pt}{2.5ex} \ \ \ \textbf{Statistic} \rule[-1ex]{0pt}{0pt} & \textbf{Sleep time (hours)} \\
\hline
\rule{0pt}{2.5ex} \text{minimum} \rule[-1ex]{0pt}{0pt} & \text{2.5} \\
\hline
\rule{0pt}{2.5ex} \text{first quartile} \rule[-1ex]{0pt}{0pt} & \text{8.0} \\
\hline
\rule{0pt}{2.5ex} \text{median} \rule[-1ex]{0pt}{0pt} & \text{10.5} \\
\hline
\rule{0pt}{2.5ex} \text{third quartile} \rule[-1ex]{0pt}{0pt} & \text{13.5} \\
\hline
\rule{0pt}{2.5ex} \text{maximum} \rule[-1ex]{0pt}{0pt} & \text{20.0} \\
\hline
\end{array}
CORE, FUR2 2019 VCAA 2
The parallel boxplots below show the maximum daily temperature and minimum daily temperature, in degrees Celsius, for 30 days in November 2017.
- Use the information in the boxplots to complete the following sentences.
For November 2017
i. | the interquartile range for the minimum daily temperature was |
|
°C (1 mark) |
ii. | the median value for maximum daily temperature was |
|
°C higher than the |
median value for minimum daily temperature (1 mark) |
iii. | the number of days on which the maximum daily temperature was less than the median value for |
minimum daily temperature was |
|
(1 mark) |
- The temperature difference between the minimum daily temperature and the maximum daily temperature in November 2017 at this location is approximately normally distributed with a mean of 9.4 °C and a standard deviation of 3.2 °C.
Determine the number of days in November 2017 for which this temperature difference is expected to be greater than 9.4 °C. (1 mark)
CORE, FUR2 2018 VCAA 1
The data in Table 1 relates to the impact of traffic congestion in 2016 on travel times in 23 cities in the United Kingdom (UK).
The four variables in this data set are:
- city — name of city
- congestion level — traffic congestion level (high, medium, low)
- size — size of city (large, small)
- increase in travel time — increase in travel time due to traffic congestion (minutes per day).
- How many variables in this data set are categorical variables? (1 mark)
- How many variables in this data set are ordinal variables (1 mark)
- Name the large UK cities with a medium level of traffic congestion. (1 mark)
- Use the data in Table 1 to complete the following two-way frequency table, Table 2. (2 marks)
- What percentage of the small cities have a high level of traffic congestion? (1 mark)
Traffic congestion can lead to an increase in travel times in cities. The dot plot and boxplot below both show the increase in travel time due to traffic congestion, in minutes per day, for the 23 UK cities.
- Describe the shape of the distribution of the increase in travel time for the 23 cities. (1 mark)
- The data value 52 is below the upper fence and is not an outlier.
Determine the value of the upper fence. (1 mark)
CORE, FUR2 2017 VCAA 2
The back-to-back stem plot below displays the wingspan, in millimetres, of 32 moths and their place of capture (forest or grassland).
- Which variable, wingspan or place of capture, is a categorical variable? (1 mark)
- Write down the modal wingspan, in millimetres, of the moths captured in the forest. (1 mark)
- Use the information in the back-to-back stem plot to complete the table below. (2 marks)
- Show that the moth captured in the forest that had a wingspan of 52 mm is an outlier. (2 marks)
- The back-to-back stem plot suggests that wingspan is associated with place of capture.
Explain why, quoting the values of an appropriate statistic. (2 marks)
CORE, FUR2 2016 VCAA 2
A weather station records daily maximum temperatures
- The five-number summary for the distribution of maximum temperatures for the month of February is displayed in the table below.
- The boxplots below display the distribution of maximum daily temperature for the months of May and July.
- Describe the shapes of the distributions of daily temperature (including outliers) for July and for May. (1 mark)
- Determine the value of the upper fence for the July boxplot. (1 mark)
- Using the information from the boxplots, explain why the maximum daily temperature is associated with the month of the year. Quote the values of appropriate statistics in your response. (1 mark)
- Describe the shapes of the distributions of daily temperature (including outliers) for July and for May. (1 mark)
CORE, FUR2 2008 VCAA 3
The arm spans (in cm) were also recorded for each of the Years 6, 8 and 10 girls in the larger survey. The results are summarised in the three parallel box plots displayed below.
- Complete the following sentence.
The middle 50% of Year 6 students have an arm span between _______ and _______ cm. (1 mark)
- The three parallel box plots suggest that arm span and year level are associated.
Explain why. (1 mark)
- The arm span of 110 cm of a Year 10 girl is shown as an outlier on the box plot. This value is an error. Her real arm span is 140 cm. If the error is corrected, would this girl’s arm span still show as an outlier on the box plot? Give reasons for your answer showing an appropriate calculation. (2 marks)
CORE, FUR2 2011 VCAA 1
The stemplot in Figure 1 shows the distribution of the average age, in years, at which women first marry in 17 countries.
- For these countries, determine
- the lowest average age of women at first marriage (1 mark)
- the median average age of women at first marriage (1 mark)
The stemplot in Figure 2 shows the distribution of the average age, in years, at which men first marry in 17 countries.
- For these countries, determine the interquartile range (IQR) for the average age of men at first marriage. (1 mark)
- If the data values displayed in Figure 2 were used to construct a boxplot with outliers, then the country for which the average age of men at first marriage is 26.0 years would be shown as an outlier.
Explain why is this so. Show an appropriate calculation to support your explanation. (2 marks)
CORE, FUR1 2015 VCAA 6-7 MC
The following information relates to Parts 1 and 2.
In New Zealand, rivers flow into either the Pacific Ocean (the Pacific rivers) or the Tasman Sea (the Tasman rivers).
The boxplots below can be used to compare the distribution of the lengths of the Pacific rivers and the Tasman rivers.
Part 1
The five-number summary for the lengths of the Tasman rivers is closest to
- `32, 48, 64, 76, 108`
- `32, 48, 64, 76, 180`
- `32, 48, 64, 76, 322`
- `48, 64, 97, 169, 180`
- `48, 64, 97, 169, 322`
Part 2
Which one of the following statements is not true?
- The lengths of two of the Tasman rivers are outliers.
- The median length of the Pacific rivers is greater than the length of more than 75% of the Tasman rivers.
- The Pacific rivers are more variable in length than the Tasman rivers.
- More than half of the Pacific rivers are less than 100 km in length.
- More than half of the Tasman rivers are greater than 60 km in length.