The boxplot below displays the distribution of all gold medal-winning heights for the women's high jump, \(\textit{Wgold}\), in metres, for the 19 Olympic Games held from 1948 to 2020. --- 2 WORK AREA LINES (style=lined) --- --- 1 WORK AREA LINES (style=lined) --- --- 3 WORK AREA LINES (style=lined) --- --- 2 WORK AREA LINES (style=lined) ---
Data Analysis, GEN2 2024 VCAA 1
Table 1 lists the Olympic year, \(\textit{year}\), and the gold medal-winning height for the men's high jump, \(\textit{Mgold}\), in metres, for each Olympic Games held from 1928 to 2020. No Olympic Games were held in 1940 or 1944, and the 2020 Olympic Games were held in 2021. Table 1 \begin{array}{|c|c|} --- 1 WORK AREA LINES (style=lined) --- --- 2 WORK AREA LINES (style=lined) --- --- 3 WORK AREA LINES (style=lined) --- --- 0 WORK AREA LINES (style=lined) --- --- 0 WORK AREA LINES (style=lined) --- --- 4 WORK AREA LINES (style=lined) ---
\hline \quad \textit{year} \quad & \textit{Mgold}(m) \\
\hline 1928 & 1.94 \\
\hline 1932 & 1.97 \\
\hline 1936 & 2.03 \\
\hline 1948 & 1.98 \\
\hline 1952 & 2.04 \\
\hline 1956 & 2.12 \\
\hline 1960 & 2.16 \\
\hline 1964 & 2.18 \\
\hline 1968 & 2.24 \\
\hline 1972 & 2.23 \\
\hline 1976 & 2.25 \\
\hline 1980 & 2.36 \\
\hline 1984 & 2.35 \\
\hline 1988 & 2.38 \\
\hline 1992 & 2.34 \\
\hline 1996 & 2.39 \\
\hline 2000 & 2.35 \\
\hline 2004 & 2.36 \\
\hline 2008 & 2.36 \\
\hline 2012 & 2.33 \\
\hline 2016 & 2.38 \\
\hline 2020 & 2.37 \\
\hline
\end{array}
Data Analysis, GEN1 2024 VCAA 5 MC
The number of siblings of each member of a class of 24 students was recorded.
The results are displayed in the table below.
\begin{array}{|c|c|c|c|c|c|c|c|c|c|c|c|}
\hline
\rule{0pt}{2.5ex} \quad 2 \quad \rule[-1ex]{0pt}{0pt} & \quad 1 \quad & \quad 3 \quad & \quad 2 \quad & \quad 1 \quad & \quad 1 \quad & \quad 1 \quad & \quad 4 \quad & \quad 1 \quad & \quad 1 \quad & \quad 1 \quad & \quad 1 \quad \\
\hline
\rule{0pt}{2.5ex} 1 \rule[-1ex]{0pt}{0pt} & 2 & 1 & 2 & 2 & 1 & 3 & 4 & 2 & 2 & 3 & 1 \\
\hline
\end{array}
A boxplot was constructed to display the spread of the data.
Which one of the following statements about this boxplot is correct?
- There are no outliers.
- The value of the interquartile range (IQR) is 1.5
- The value of the median is 1.5
- All of the five-number summary values are whole numbers.
CORE, FUR2 2021 VCAA 1
In the sport of heptathlon, athletes compete in seven events.
These events are the 100 m hurdles, high jump, shot-put, javelin, 200 m run, 800 m run and long jump.
Fifteen female athletes competed to qualify for the heptathlon at the Olympic Games.
Their results for three of the heptathlon events – high jump, shot-put and javelin – are shown in Table 1.
- Write down the number of numerical variables in Table 1. (1 mark)
- Complete Table 2 below by calculating the mean height jumped for the high jump, in metres, by the 15 athletes. Write your answer in the space provided in the table. (1 mark)
- In shot-put, athletes throw a heavy spherical ball (a shot) as far as they can.
Athlete number six, Jamilia, threw the shot 14.50 m.
Calculate Jamilia's standardised score (`z`).
Round your answer to one decimal place. (1 mark) - In the qualifying competition, the heights jumped in the high jump are expected to be approximately normally distributed.
Chara's jump in this competition would give her a standardised score of `z = –1.0`
Use the 68–95–99.7% rule to calculate the percentage of athletes who would be expected to jump higher than Chara in the qualifying competition. (1 mark)
- The boxplot below was constructed to show the distribution of high jump heights for all 15 athletes in the qualifying competition.
Explain why the boxplot has no whisker at its upper end. (1 mark) - For the javelin qualifying competition (refer to Table 1), another boxplot is used to display the distribution of athlete's results.
An athlete whose result is displayed as an outlier at the upper end of the plot is considered to be a potential medal winner in the event.
What is the minimum distance that an athlete needs to throw the javelin to be considered a potential medal winner? (2 marks)
CORE, FUR2 2020 VCAA 2
The neck size, in centimetres, of 250 men was recorded and displayed in the dot plot below.
- Write down the modal neck size, in centimetres, for these 250 men. (1 mark)
- Assume that this sample of 250 men has been drawn at random from a population of men whose neck size is normally distributed with a mean of 38 cm and a standard deviation of 2.3 cm.
- i. How many of these 250 men are expected to have a neck size that is more than three standard deviations above or below the mean?
- Round your answer to the nearest whole number. (1 mark)
- ii. How many of these 250 men actually have a neck size that is more than three standard deviations above or below the mean? (1 mark)
- The five-number summary for this sample of neck sizes, in centimetres, is given below.
`qquad`
Use the five-number summary to construct a boxplot, showing any outliers if appropriate, on the grid below. (2 marks)
CORE, FUR1-NHT 2019 VCAA 4 MC
CORE, FUR1-NHT 2019 VCAA 1-2 MC
The histogram and boxplot shown below both display the distribution of the birth weight, in grams, of 200 babies.
Part 1
The shape of the distribution of the babies’ birth weight is best described as
- positively skewed with no outliers.
- negatively skewed with no outliers.
- approximately symmetric with no outliers.
- positively skewed with outliers.
- approximately symmetric with outliers.
Part 2
The number of babies with a birth weight between 3000 g and 3500 g is closest to
- 30
- 32
- 37
- 74
- 80
CORE, FUR2 2018 VCAA 1
The data in Table 1 relates to the impact of traffic congestion in 2016 on travel times in 23 cities in the United Kingdom (UK).
The four variables in this data set are:
- city — name of city
- congestion level — traffic congestion level (high, medium, low)
- size — size of city (large, small)
- increase in travel time — increase in travel time due to traffic congestion (minutes per day).
- How many variables in this data set are categorical variables? (1 mark)
- How many variables in this data set are ordinal variables (1 mark)
- Name the large UK cities with a medium level of traffic congestion. (1 mark)
- Use the data in Table 1 to complete the following two-way frequency table, Table 2. (2 marks)
- What percentage of the small cities have a high level of traffic congestion? (1 mark)
Traffic congestion can lead to an increase in travel times in cities. The dot plot and boxplot below both show the increase in travel time due to traffic congestion, in minutes per day, for the 23 UK cities.
- Describe the shape of the distribution of the increase in travel time for the 23 cities. (1 mark)
- The data value 52 is below the upper fence and is not an outlier.
Determine the value of the upper fence. (1 mark)
CORE, FUR1 2017 VCAA 1-3 MC
The boxplot below shows the distribution of the forearm circumference, in centimetres, of 252 people.
Part 1
The percentage of these 252 people with a forearm circumference of less than 30 cm is closest to
- `text(15%)`
- `text(25%)`
- `text(50%)`
- `text(75%)`
- `text(100%)`
Part 2
The five-number summary for the forearm circumference of these 252 people is closest to
- `\ \ \ 21,\ 27.4,\ 28.7,\ 30,\ 34`
- `\ \ \ 21,\ 27.4,\ 28.7,\ 30,\ 35.9`
- `24.5,\ 27.4,\ 28.7,\ 30,\ 34`
- `24.5,\ 27.4,\ 28.7,\ 30,\ 35.9`
- `24.5,\ 27.4,\ 28.7,\ 30,\ 36`
Part 3
The table below shows the forearm circumference, in centimetres, of a sample of 10 people selected from this group of 252 people.
The mean, `barx`, and the standard deviation, `s_x`, of the forearm circumference for this sample of people are closest to
- `barx = 1.58qquads_x = 27.8`
- `barx = 1.66qquads_x = 27.8`
- `barx = 27.8qquads_x = 1.58`
- `barx = 27.8qquads_x = 1.66`
- `barx = 27.8qquads_x = 2.30`
CORE, FUR2 2016 VCAA 2
A weather station records daily maximum temperatures
- The five-number summary for the distribution of maximum temperatures for the month of February is displayed in the table below.
- The boxplots below display the distribution of maximum daily temperature for the months of May and July.
- Describe the shapes of the distributions of daily temperature (including outliers) for July and for May. (1 mark)
- Determine the value of the upper fence for the July boxplot. (1 mark)
- Using the information from the boxplots, explain why the maximum daily temperature is associated with the month of the year. Quote the values of appropriate statistics in your response. (1 mark)
- Describe the shapes of the distributions of daily temperature (including outliers) for July and for May. (1 mark)
CORE, FUR1 2015 VCAA 8 MC
CORE, FUR1 2009 VCAA 4-6 MC
The percentage histogram below shows the distribution of the fertility rates (in average births per woman) for 173 countries in 1975.
Part 1
In 1975, the percentage of these 173 countries with fertility rates of 4.5 or greater was closest to
A. `12text(%)`
B. `35text(%)`
C. `47text(%)`
D. `53text(%)`
E. `65text(%)`
Part 2
In 1975, for these 173 countries, fertility rates were most frequently
A. less than 2.5
B. between 1.5 and 2.5
C. between 2.5 and 4.5
D. between 6.5 and 7.5
E. greater than 7.5
Part 3
Which one of the boxplots below could best be used to represent the same fertility rate data as displayed in the percentage histogram?
CORE, FUR1 2008 VCAA 1-4 MC
The box plot below shows the distribution of the time, in seconds, that 79 customers spent moving along a particular aisle in a large supermarket.
Part 1
The longest time, in seconds, spent moving along this aisle is closest to
A. `40`
B. `60`
C. `190`
D. `450`
E. `500`
Part 2
The shape of the distribution is best described as
A. symmetric.
B. negatively skewed.
C. negatively skewed with outliers.
D. positively skewed.
E. positively skewed with outliers.
Part 3
The number of customers who spent more than 90 seconds moving along this aisle is closest to
A. `7`
B. `20`
C. `26`
D. `75`
E. `79`
Part 4
From the box plot, it can be concluded that the median time spent moving along the supermarket aisle is
A. less than the mean time.
B. equal to the mean time.
C. greater than the mean time
D. half of the interquartile range.
E. one quarter of the range.
CORE, FUR1 2014 VCAA 6 MC
CORE, FUR1 2012 VCAA 5 MC
The temperature of a room is measured at hourly intervals throughout the day.
The most appropriate graph to show how the temperature changes from one hour to the next is a
A. boxplot.
B. stem plot.
C. histogram.
D. time series plot.
E. two-way frequency table.
CORE, FUR1 2010 VCAA 1-3 MC
To test the temperature control on an oven, the control is set to 180°C and the oven is heated for 15 minutes.
The temperature of the oven is then measured. Three hundred ovens were tested in this way. Their temperatures were recorded and are displayed below using both a histogram and a boxplot.
Part 1
A total of 300 ovens were tested and their temperatures were recorded.
The number of these temperatures that lie between 179°C and 181°C is closest to
A. `40`
B. `50`
C. `70`
D. `110`
E. `150`
Part 2
The interquartile range for temperature is closest to
A. `1.3°text(C)`
B. `1.5°text(C)`
C. `2.0°text(C)`
D. `2.7°text(C)`
E. `4.0°text(C)`
Part 3
Using the 68–95–99.7% rule, the standard deviation for temperature is closest to
A. `1°text(C)`
B. `2°text(C)`
C. `3°text(C)`
D. `4°text(C)`
E. `6°text(C)`