Helping math teachers bring statistics to life

Stats Medic.png

What is a Sampling Distribution? Day 1

Chapter 7 - day 1 - lesson 7.1, all chapters, learning targets.

Distinguish between a parameter and a statistic.

Create a sampling distribution using all possible samples from a small population.

Distinguish among the distribution of a population, the distribution of a sample, and the sampling distribution of a statistic.

Activity: What was the average for the Chapter 6 Test?

jpg.jpg

Answer Key:

In this Activity, students will be trying to estimate the mean test score for a population using a the mean calculated from a sample. We start with a very simple and unrealistic population of 4 students. We do this to help students build the idea that a sampling distribution contains allof the possible samples from the population (easy to do with such a small population). Tomorrow we will be more realistic and look at the actual population of all AP Stats students.

Where are we headed?

Noti ce the organization of this Chapter.

Section 7.1 is an introduction to sampling distributions, which includes sampling distributions for proportions and sampling distributions for means. Actually it includes sampling distributions for any statistic.

Section 7.2, we investigate the shape, center, and variability of the sampling distribution of a sample proportion.

Section 7.3, we investigate the shape, center, and variability of the sampling distribution of a sample mean.

The Activity uses a sampling distribution for a sample mean. The Check Your Understanding problem uses a sampling distribution for a sample proportion.

Have I seen this before?

This is not our students first experience with sampling distributions. We have intentionally given them previous experiences in preparation for today’s lesson. In Chapter 4, we took samples of  5 words from from Beyonce’s Crazy in Love  in order to estimate the mean word length. We also took  samples of Justin Timberlake fans  to find the mean enjoyment level. Hopefully you made dotplot posters for these activities and you can refer back to them in this Chapter.

Notation matters.

Starting right now, we are going to be crazy about using the correct notation. Notation is wonderful because we can show several ideas at once (is this value from a sample or a population?, is this value a mean or a proportion?).

homework section 7 2 stats

Population distribution, distribution of a sample, or a sampling distribution?

All three of these distributions can be represented with a dotplot in the Activity. In a population distribution (#1), each dot represents one individual from the population (and we have a dot for every individual). In a distribution of a sample, each dot represents one individual from the population (but we don’t have every individual…only a sample of 2). In a sampling distribution (#4), each dot represents a sample from the population and a mean calculated from that sample.The common error that students make is to use the term “sample distribution” when they mean “sampling distribution”. A sample distribution is the distribution of values for one sample. A sampling distribution represents many, many samples.

Logo for LOUIS Pressbooks: Open Educational Resources from the Louisiana Library Network

Chapter 2: Descriptive Statistics

Chapter 2 Homework

Homework from 2.1.

Student grades on a chemistry exam were: 77, 78, 76, 81, 86, 51, 79, 82, 84, 99

  • Construct a stem-and-leaf plot of the data.
  • Are there any potential outliers? If so, which scores are they? Why do you consider them outliers?

[link] contains the 2010 obesity rates in U.S. states and Washington, DC.

  • Use a random number generator to randomly pick eight states. Construct a bar graph of the obesity rates of those eight states.
  • Construct a bar graph for all the states beginning with the letter “A.”
  • Construct a bar graph for all the states beginning with the letter “M.”
  • Number the entries in the table 1–51 (Includes Washington, DC; Numbered vertically)
  • Arrow over to PRB
  • Press 5:randInt(
  • Enter 51,1,8)

Eight numbers are generated (use the right arrow key to scroll through the numbers). The numbers correspond to the numbered states (for this example: {47 21 9 23 51 13 25 4}. If any numbers are repeated, generate a different number by using 5:randInt(51,1)). Here, the states (and Washington DC) are {Arkansas, Washington DC, Idaho, Maryland, Michigan, Mississippi, Virginia, Wyoming}.

Corresponding percentages are {30.1, 22.2, 26.5, 27.1, 30.9, 34.0, 26.0, 25.1}.

A bar graph showing 8 states on the x-axis and corresponding obesity rates on the y-axis.

Homework from 2.2

Suppose that three book publishers were interested in the number of fiction paperbacks adult consumers purchase per month. Each publisher conducted a survey. In the survey, adult consumers were asked the number of fiction paperbacks they had purchased the previous month. The results are as follows:

  • Find the relative frequencies for each survey. Write them in the charts.
  • Using either a graphing calculator, computer, or by hand, use the frequency column to construct a histogram for each publisher’s survey. For Publishers A and B, make bar widths of one. For Publisher C, make bar widths of two.
  • In complete sentences, give two reasons why the graphs for Publishers A and B are not identical.
  • Would you have expected the graph for Publisher C to look like the other two graphs? Why or why not?
  • Make new histograms for Publisher A and Publisher B. This time, make bar widths of two.
  • Now, compare the graph for Publisher C to the new graphs for Publishers A and B. Are the graphs more similar or more different? Explain your answer.

Often, cruise ships conduct all on-board transactions, with the exception of gambling, on a cashless basis. At the end of the cruise, guests pay one bill that covers all onboard transactions. Suppose that 60 single travelers and 70 couples were surveyed as to their on-board bills for a seven-day cruise from Los Angeles to the Mexican Riviera. Following is a summary of the bills for each group.

  • Fill in the relative frequency for each group.
  • Construct a histogram for the singles group. Scale the x -axis by 💲50 widths. Use relative frequency on the y -axis.
  • Construct a histogram for the couples group. Scale the x -axis by 💲50 widths. Use relative frequency on the y -axis.
  • List two similarities between the graphs.
  • List two differences between the graphs.
  • Overall, are the graphs more similar or different?
  • Construct a new graph for the couples by hand. Since each couple is paying for two individuals, instead of scaling the x -axis by 💲50, scale it by 💲100. Use relative frequency on the y -axis.
  • How did scaling the couples graph differently change the way you compared it to the singles graph?
  • Based on the graphs, do you think that individuals spend the same amount, more or less, as singles as they do person by person as a couple? Explain why in one or two complete sentences.
  • See [link] and [link] .

This is a histogram that matches the supplied data supplied for singles. The x-axis shows the total charges in intervals of 50 from 50 to 350, and the y-axis shows the relative frequency in increments of 0.05 from 0 to 0.3.

  • Both graphs have a single peak.
  • Both graphs use class intervals with width equal to 💲50.
  • The couples graph has a class interval with no values.
  • It takes almost twice as many class intervals to display the data for couples.
  • Answers may vary. Possible answers include: The graphs are more similar than different because the overall patterns for the graphs are the same.
  • Check student’s solution.
  • Both graphs display 6 class intervals.
  • Both graphs show the same general pattern.
  • Answers may vary. Possible answers include: Although the width of the class intervals for couples is double that of the class intervals for singles, the graphs are more similar than they are different.
  • Answers may vary. Possible answers include: You are able to compare the graphs interval by interval. It is easier to compare the overall patterns with the new scale on the Couples graph. Because a couple represents two individuals, the new scale leads to a more accurate comparison.
  • Answers may vary. Possible answers include: Based on the histograms, it seems that spending does not vary much from singles to individuals who are part of a couple. The overall patterns are the same. The range of spending for couples is approximately double the range for individuals.

Twenty-five randomly selected students were asked the number of movies they watched the previous week. The results are as follows.

  • Construct a histogram of the data.
  • Complete the columns of the chart.

Use the following information to answer the next two exercises: Suppose one hundred eleven people who shopped in a special T-shirt store were asked the number of T-shirts they own costing more than 💲19 each.

A histogram showing the results of a survey. Of 111 respondents, 5 own 1 t-shirt costing more than 💲19, 17 own 2, 23 own 3, 39 own 4, 25 own 5, 2 own 6, and no respondents own 7.

The percentage of people who own at most three T-shirts costing more than 💲19 each is approximately:

  • Cannot be determined

If the data were collected by asking the first 111 people who entered the store, then the type of sampling is:

  • simple random
  • convenience

Following are the 2010 obesity rates by U.S. states and Washington, DC.

Construct a bar graph of obesity rates of your state and the four states closest to your state. Hint: Label the x -axis with the states.

Answers will vary.

Homework from 2.3

The median age for U.S. blacks currently is 30.9 years; for U.S. whites it is 42.3 years.

Six hundred adult Americans were asked by telephone poll, “What do you think constitutes a middle-class income?” The results are in [link] . Also, include left endpoint, but not the right endpoint.

  • What percentage of the survey answered “not sure”?
  • What percentage think that middle-class is from 💲25,000 to 💲50,000?
  • Should all bars have the same width, based on the data? Why or why not?
  • How should the <20,000 and the 100,000+ intervals be handled? Why?
  • Find the 40 th and 80 th percentiles
  • Construct a bar graph of the data
  • 1 – (0.02+0.09+0.19+0.26+0.18+0.17+0.02+0.01) = 0.06
  • 0.19+0.26+0.18 = 0.63

40 th percentile will fall between 30,000 and 40,000

80 th percentile will fall between 50,000 and 75,000

Given the following box plot:

This is a horizontal box plot graphed over a number line from 0 to 13. The first whisker extends from the smallest value, 0, to the first quartile, 2. The box begins at the first quartile and extends to the third quartile, 12. A vertical, dashed line is drawn at median, 10. The second whisker extends from the third quartile to largest value, 13.

  • which quarter has the smallest spread of data? What is that spread?
  • which quarter has the largest spread of data? What is that spread?
  • find the interquartile range ( IQR ).
  • are there more data in the interval 5–10 or in the interval 10–13? How do you know this?
  • need more information

The following box plot shows the U.S. population for 1990, the latest available year.

A box plot with values from 0 to 105, with Q1 at 17, M at 33, and Q3 at 50.

  • Are there fewer or more children (age 17 and under) than senior citizens (age 65 and over)? How do you know?
  • 12.6% are age 65 and over. Approximately what percentage of the population are working age adults (above age 17 to age 65)?
  • more children; the left whisker shows that 25% of the population are children 17 and younger. The right whisker shows that 25% of the population are adults 50 and older, so adults 65 and over represent less than 25%.

Homework from 2.4

In a survey of 20-year-olds in China, Germany, and the United States, people were asked the number of foreign countries they had visited in their lifetime. The following box plots display the results.

This shows three box plots graphed over a number line from 0 to 11. The box plots match the supplied data, and compare the countries' results. The China box plot has a single whisker from 0 to 5. The Germany box plot's median is equal to the third quartile, so there is a dashed line at right edge of box. The America box plot does not have a left whisker.

  • In complete sentences, describe what the shape of each box plot implies about the distribution of the data collected.
  • Have more Americans or more Germans surveyed been to over eight foreign countries?
  • Compare the three box plots. What do they imply about the foreign travel of 20-year-old residents of the three countries when compared to each other?

Given the following box plot, answer the questions.

This is a box plot graphed over a number line from 0 to 150. There is no first, or left, whisker. The box starts at the first quartile, 0, and ends at the third quartile, 80. A vertical, dashed line marks the median, 20. The second whisker extends the third quartile to the largest value, 150.

  • Think of an example (in words) where the data might fit into the above box plot. In 2–5 sentences, write down the example.
  • What does it mean to have the first and second quartiles so close together, while the second to third quartiles are far apart?
  • Answers will vary. Possible answer: State University conducted a survey to see how involved its students are in community service. The box plot shows the number of community service hours logged by participants over the past year.
  • Because the first and second quartiles are close, the data in this quarter is very similar. There is not much variation in the values. The data in the third quarter is much more variable or spread out. This is clear because the second quartile is so far away from the third quartile.

Given the following box plots, answer the questions.

This shows two box plots graphed over number lines from 0 to 7. The first whisker in the data 1 box plot extends from 0 to 2. The box begins at the first quartile, 2, and ends at the third quartile, 5. A vertical, dashed line marks the median at 4. The second whisker extends from the third quartile to the largest value, 7. The first whisker in the data 2 box plot extends from 0 to 1.3. The box begins at the first quartile, 1.3, and ends at the third quartile, 2.5. A vertical, dashed line marks the medial at 2. The second whisker extends from the third quartile to the largest value, 7.

  • Data 1 has more data values above two than Data 2 has above two.
  • The data sets cannot have the same mode.
  • For Data 1 , there are more data values below four than there are above four.
  • For which group, Data 1 or Data 2, is the value of “7” more likely to be an outlier? Explain why in complete sentences.

A survey was conducted of 130 purchasers of new BMW 3 series cars, 130 purchasers of new BMW 5 series cars, and 130 purchasers of new BMW 7 series cars. In it, people were asked the age they were when they purchased their car. The following box plots display the results.

This shows three box plots graphed over a number line from 25 to 80. The first whisker on the BMW 3 plot extends from 25 to 30. The box begins at the first quartile, 30 and ends at the third quartile, 41. A vertical, dashed line marks the median at 34. The second whisker extends from the third quartile to 66. The first whisker on the BMW 5 plot extends from 31 to 40. The box begins at the first quartile, 40, and ends at the third quartile, 55. A vertical, dashed line marks the median at 41. The second whisker extends from 55 to 64. The first whisker on the BMW 7 plot extends from 35 to 41. The box begins at the first quartile, 41, and ends at the third quartile, 59. A vertical, dashed line marks the median at 46. The second whisker extends from 59 to 68.

  • In complete sentences, describe what the shape of each box plot implies about the distribution of the data collected for that car series.
  • Which group is most likely to have an outlier? Explain how you determined that.
  • Compare the three box plots. What do they imply about the age of purchasing a BMW from the series when compared to each other?
  • Look at the BMW 5 series. Which quarter has the smallest spread of data? What is the spread?
  • Look at the BMW 5 series. Which quarter has the largest spread of data? What is the spread?
  • Look at the BMW 5 series. Estimate the interquartile range (IQR).
  • Look at the BMW 5 series. Are there more data in the interval 31 to 38 or in the interval 45 to 55? How do you know this?
  • Each box plot is spread out more in the greater values. Each plot is skewed to the right, so the ages of the top 50% of buyers are more variable than the ages of the lower 50%.
  • The BMW 3 series is most likely to have an outlier. It has the longest whisker.
  • Comparing the median ages, younger people tend to buy the BMW 3 series, while older people tend to buy the BMW 7 series. However, this is not a rule, because there is so much variability in each data set.
  • The second quarter has the smallest spread. There seems to be only a three-year difference between the first quartile and the median.
  • The third quarter has the largest spread. There seems to be approximately a 14-year difference between the median and the third quartile.
  • IQR ~ 17 years
  • There is not enough information to tell. Each interval lies within a quarter, so we cannot tell exactly where the data in that quarter is concentrated.
  • The interval from 31 to 35 years has the fewest data values. Twenty-five percent of the values fall in the interval 38 to 41, and 25% fall between 41 and 64. Since 25% of values fall between 31 and 38, we know that fewer than 25% fall between 31 and 35.

Twenty-five randomly selected students were asked the number of movies they watched the previous week. The results are as follows:

Construct a box plot of the data.

Homework from 2.5

The most obese countries in the world have obesity rates that range from 11.4% to 74.6%. This data is summarized in the following table.

  • What is the best estimate of the average obesity percentage for these countries?
  • The United States has an average obesity rate of 33.9%. Is this rate above average or below?
  • How does the United States compare to other countries?

[link] gives the percent of children under five considered to be underweight. What is the best estimate for the mean percentage of underweight children?

The mean percentage, [latex]\overline{x}=\frac{1328.65}{50}=26.75[/latex]

Homework from 2.6

The median age of the U.S. population in 1980 was 30.0 years. In 1991, the median age was 33.1 years.

  • What does it mean for the median age to rise?
  • Give two reasons why the median age could rise.
  • For the median age to rise, is the actual number of children less in 1991 than it was in 1980? Why or why not?

Homework from 2.7

Use the following information to answer the next nine exercises: The population parameters below describe the full-time equivalent number of students (FTES) each year at Lake Tahoe Community College from 1976–1977 through 2004–2005.

  • μ = 1000 FTES
  • median = 1,014 FTES
  • σ = 474 FTES
  • first quartile = 528.5 FTES
  • third quartile = 1,447.5 FTES
  • n = 29 years

A sample of 11 years is taken. About how many are expected to have a FTES of 1014 or above? Explain how you determined your answer.

The median value is the middle value in the ordered list of data values. The median value of a set of 11 will be the 6th number in order. Six years will have totals at or below the median.

75% of all years have an FTES:

The population standard deviation = _____

What percentage of the FTES was from 528.5 to 1447.5? How do you know?

What is the IQR ? What does the IQR represent?

How many standard deviations away from the mean is the median?

Additional Information: The population FTES for 2005–2006 through 2010–2011 was given in an updated report. The data are reported here.

Calculate the mean, median, standard deviation, the first quartile, the third quartile and the IQR . Round to one decimal place.

  • mean = 1,809.3
  • median = 1,812.5
  • standard deviation = 151.2
  • first quartile = 1,690
  • third quartile = 1,935

Construct a box plot for the FTES for 2005–2006 through 2010–2011 and a box plot for the FTES for 1976–1977 through 2004–2005.

Compare the IQR for the FTES for 1976–77 through 2004–2005 with the IQR for the FTES for 2005-2006 through 2010–2011. Why do you suppose the IQR s are so different?

Hint: Think about the number of years covered by each time period and what happened to higher education during those periods.

Three students were applying to the same graduate school. They came from schools with different grading systems. Which student had the best GPA when compared to other students at his school? Explain how you determined your answer.

A music school has budgeted to purchase three musical instruments. They plan to purchase a piano costing 💲3,000, a guitar costing 💲550, and a drum set costing 💲600. The mean cost for a piano is 💲4,000 with a standard deviation of 💲2,500. The mean cost for a guitar is 💲500 with a standard deviation of 💲200. The mean cost for drums is 💲700 with a standard deviation of 💲100. Which cost is the lowest, when compared to other instruments of the same type? Which cost is the highest when compared to other instruments of the same type. Justify your answer.

For pianos, the cost of the piano is 0.4 standard deviations BELOW the mean. For guitars, the cost of the guitar is 0.25 standard deviations ABOVE the mean. For drums, the cost of the drum set is 1.0 standard deviations BELOW the mean. Of the three, the drums cost the lowest in comparison to the cost of other instruments of the same type. The guitar costs the most in comparison to the cost of other instruments of the same type.

An elementary school class ran one mile with a mean of 11 minutes and a standard deviation of three minutes. Rachel, a student in the class, ran one mile in eight minutes. A junior high school class ran one mile with a mean of nine minutes and a standard deviation of two minutes. Kenji, a student in the class, ran 1 mile in 8.5 minutes. A high school class ran one mile with a mean of seven minutes and a standard deviation of four minutes. Nedda, a student in the class, ran one mile in eight minutes.

  • Why is Kenji considered a better runner than Nedda, even though Nedda ran faster than him?
  • Who is the fastest runner with respect to his or her class? Explain why.

The most obese countries in the world have obesity rates that range from 11.4% to 74.6%. This data is summarized in Table 14 .

What is the best estimate of the average obesity percentage for these countries? What is the standard deviation for the listed obesity rates? The United States has an average obesity rate of 33.9%. Is this rate above average or below? How “unusual” is the United States’ obesity rate compared to the average rate? Explain.

  • [latex]\overline{x}=23.32[/latex]
  • Using the TI 83/84, we obtain a standard deviation of: [latex]{s}_{x}=12.95.[/latex]
  • The obesity rate of the United States is 10.58% higher than the average obesity rate.
  • Since the standard deviation is 12.95, we see that 23.32 + 12.95 = 36.27 is the obesity percentage that is one standard deviation from the mean. The United States obesity rate is slightly less than one standard deviation from the mean. Therefore, we can assume that the United States, while 34% obese, does not have an unusually high percentage of obese people.

[link] gives the percent of children under five considered to be underweight.

What is the best estimate for the mean percentage of underweight children? What is the standard deviation? Which interval(s) could be considered unusual? Explain.

used to describe data that is not symmetrical; when the right side of a graph looks “chopped off” compared to the left side, we say it is “skewed to the left.” When the left side of the graph looks “chopped off” compared to the right side, we say the data is “skewed to the right.” Alternatively: when the lower values of the data are more spread out, we say the data are skewed to the left. When the greater values are more spread out, the data are skewed to the right.

Introductory Statistics Copyright © 2024 by LOUIS: The Louisiana Library Network is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License , except where otherwise noted.

Share This Book

1.1 Definitions of Statistics, Probability, and Key Terms

For each of the following eight exercises, identify: a. the population, b. the sample, c. the parameter, d. the statistic, e. the variable, and f. the data. Give examples where appropriate.

A fitness center is interested in the mean amount of time a client exercises in the center each week.

Ski resorts are interested in the mean age that children take their first ski and snowboard lessons. They need this information to plan their ski classes optimally.

A cardiologist is interested in the mean recovery period of her patients who have had heart attacks.

Insurance companies are interested in the mean health costs each year of their clients, so that they can determine the costs of health insurance.

A politician is interested in the proportion of voters in his district who think he is doing a good job.

A marriage counselor is interested in the proportion of clients she counsels who stay married.

Political pollsters may be interested in the proportion of people who will vote for a particular cause.

A marketing company is interested in the proportion of people who will buy a particular product.

Use the following information to answer the next three exercises: A Lake Tahoe Community College instructor is interested in the mean number of days Lake Tahoe Community College math students are absent from class during a quarter.

What is the population she is interested in?

  • all Lake Tahoe Community College students
  • all Lake Tahoe Community College English students
  • all Lake Tahoe Community College students in her classes
  • all Lake Tahoe Community College math students

Consider the following:

X X = number of days a Lake Tahoe Community College math student is absent

In this case, X is an example of a:

  • population.

The instructor’s sample produces a mean number of days absent of 3.5 days. This value is an example of a:

1.2 Data, Sampling, and Variation in Data and Sampling

For the following exercises, identify the type of data that would be used to describe a response (quantitative discrete, quantitative continuous, or qualitative), and give an example of the data.

number of tickets sold to a concert

percent of body fat

favorite baseball team

time in line to buy groceries

number of students enrolled at Evergreen Valley College

most-watched television show

brand of toothpaste

distance to the closest movie theatre

age of executives in Fortune 500 companies

number of competing computer spreadsheet software packages

Use the following information to answer the next two exercises: A study was done to determine the age, number of times per week, and the duration (amount of time) of resident use of a local park in San Jose. The first house in the neighborhood around the park was selected randomly and then every 8th house in the neighborhood around the park was interviewed.

“Number of times per week” is what type of data?

  • qualitative
  • quantitative discrete
  • quantitative continuous

“Duration (amount of time)” is what type of data?

Airline companies are interested in the consistency of the number of babies on each flight, so that they have adequate safety equipment. Suppose an airline conducts a survey. Over Thanksgiving weekend, it surveys six flights from Boston to Salt Lake City to determine the number of babies on the flights. It determines the amount of safety equipment needed by the result of that study.

  • Using complete sentences, list three things wrong with the way the survey was conducted.
  • Using complete sentences, list three ways that you would improve the survey if it were to be repeated.

Suppose you want to determine the mean number of students per statistics class in your state. Describe a possible sampling method in three to five complete sentences. Make the description detailed.

Suppose you want to determine the mean number of cans of soda drunk each month by students in their twenties at your school. Describe a possible sampling method in three to five complete sentences. Make the description detailed.

List some practical difficulties involved in getting accurate results from a telephone survey.

List some practical difficulties involved in getting accurate results from a mailed survey.

With your classmates, brainstorm some ways you could overcome these problems if you needed to conduct a phone or mail survey.

The instructor takes her sample by gathering data on five randomly selected students from each Lake Tahoe Community College math class. The type of sampling she used is

  • cluster sampling
  • stratified sampling
  • simple random sampling
  • convenience sampling

A study was done to determine the age, number of times per week, and the duration (amount of time) of residents using a local park in San Jose. The first house in the neighborhood around the park was selected randomly and then every eighth house in the neighborhood around the park was interviewed. The sampling method was:

  • simple random

Name the sampling method used in each of the following situations:

  • A woman in the airport is handing out questionnaires to travelers asking them to evaluate the airport’s service. She does not ask travelers who are hurrying through the airport with their hands full of luggage, but instead asks all travelers who are sitting near gates and not taking naps while they wait.
  • A teacher wants to know if her students are doing homework, so she randomly selects rows two and five and then calls on all students in row two and all students in row five to present the solutions to homework problems to the class.
  • The marketing manager for an electronics chain store wants information about the ages of its customers. Over the next two weeks, at each store location, 100 randomly selected customers are given questionnaires to fill out asking for information about age, as well as about other variables of interest.
  • The librarian at a public library wants to determine what proportion of the library users are children. The librarian has a tally sheet on which she marks whether books are checked out by an adult or a child. She records this data for every fourth patron who checks out books.
  • A political party wants to know the reaction of voters to a debate between the candidates. The day after the debate, the party’s polling staff calls 1,200 randomly selected phone numbers. If a registered voter answers the phone or is available to come to the phone, that registered voter is asked whom he or she intends to vote for and whether the debate changed his or her opinion of the candidates.

A “random survey” was conducted of 3,274 people of the “microprocessor generation” (people born since 1971, the year the microprocessor was invented). It was reported that 48% of those individuals surveyed stated that if they had $2,000 to spend, they would use it for computer equipment. Also, 66% of those surveyed considered themselves relatively savvy computer users.

  • Do you consider the sample size large enough for a study of this type? Why or why not?
  • Based on your “gut feeling,” do you believe the percents accurately reflect the U.S. population for those individuals born since 1971? If not, do you think the percents of the population are actually higher or lower than the sample statistics? Why? Additional information: The survey, reported by Intel Corporation, was filled out by individuals who visited the Los Angeles Convention Center to see the Smithsonian Institute's road show called “America’s Smithsonian.”
  • With this additional information, do you feel that all demographic and ethnic groups were equally represented at the event? Why or why not?
  • With the additional information, comment on how accurately you think the sample statistics reflect the population parameters.

The Well-Being Index is a survey that follows trends of U.S. residents on a regular basis. There are six areas of health and wellness covered in the survey: Life Evaluation, Emotional Health, Physical Health, Healthy Behavior, Work Environment, and Basic Access. Some of the questions used to measure the Index are listed below.

Identify the type of data obtained from each question used in this survey: qualitative, quantitative discrete, or quantitative continuous.

  • Do you have any health problems that prevent you from doing any of the things people your age can normally do?
  • During the past 30 days, for about how many days did poor health keep you from doing your usual activities?
  • In the last seven days, on how many days did you exercise for 30 minutes or more?
  • Do you have health insurance coverage?

In advance of the 1936 Presidential Election, a magazine titled Literary Digest released the results of an opinion poll predicting that the republican candidate Alf Landon would win by a large margin. The magazine sent post cards to approximately 10,000,000 prospective voters. These prospective voters were selected from the subscription list of the magazine, from automobile registration lists, from phone lists, and from club membership lists. Approximately 2,300,000 people returned the postcards.

  • Think about the state of the United States in 1936. Explain why a sample chosen from magazine subscription lists, automobile registration lists, phone books, and club membership lists was not representative of the population of the United States at that time.
  • What effect does the low response rate have on the reliability of the sample?
  • Are these problems examples of sampling error or nonsampling error?
  • During the same year, George Gallup conducted his own poll of 30,000 prospective voters. These researchers used a method they called "quota sampling" to obtain survey answers from specific subsets of the population. Quota sampling is an example of which sampling method described in this module?

Crime-related and demographic statistics for 47 US states in 1960 were collected from government agencies, including the FBI's Uniform Crime Report . One analysis of this data found a strong connection between education and crime indicating that higher levels of education in a community correspond to higher crime rates.

Which of the potential problems with samples discussed in 1.2 Data, Sampling, and Variation in Data and Sampling could explain this connection?

YouPolls is a website that allows anyone to create and respond to polls. One question posted April 15 asks:

“Do you feel happy paying your taxes when members of the Obama administration are allowed to ignore their tax liabilities?” (lastbaldeagle. 2013. On Tax Day, House to Call for Firing Federal Workers Who Owe Back Taxes. Opinion poll posted online at: http://www.youpolls.com/details.aspx?id=12328 (accessed May 1, 2013).)

As of April 25, 11 people responded to this question. Each participant answered “NO!”

Which of the potential problems with samples discussed in this module could explain this connection?

A scholarly article about response rates begins with the following quote:

“Declining contact and cooperation rates in random digit dial (RDD) national telephone surveys raise serious concerns about the validity of estimates drawn from such research.”(Scott Keeter et al., “Gauging the Impact of Growing Nonresponse on Estimates from a National RDD Telephone Survey,” Public Opinion Quarterly 70 no. 5 (2006), http://poq.oxfordjournals.org/content/70/5/759.full (accessed May 1, 2013).)

The Pew Research Center for People and the Press admits:

“The percentage of people we interview – out of all we try to interview – has been declining over the past decade or more.” (Frequently Asked Questions, Pew Research Center for the People & the Press, http://www.people-press.org/methodology/frequently-asked-questions/#dont-you-have-trouble-getting-people-to-answer-your-polls (accessed May 1, 2013).)

  • What are some reasons for the decline in response rate over the past decade?
  • Explain why researchers are concerned with the impact of the declining response rate on public opinion polls.

1.3 Frequency, Frequency Tables, and Levels of Measurement

Fifty part-time students were asked how many courses they were taking this term. The (incomplete) results are shown below:

  • Fill in the blanks in Table 1.33 .
  • What percent of students take exactly two courses?
  • What percent of students take one or two courses?

Sixty adults with gum disease were asked the number of times per week they used to floss before their diagnosis. The (incomplete) results are shown in Table 1.34 .

  • Fill in the blanks in Table 1.34 .
  • What percent of adults flossed six times per week?
  • What percent flossed at most three times per week?

Nineteen immigrants to the U.S were asked how many years, to the nearest year, they have lived in the U.S. The data are as follows: 2 ; 5 ; 7 ; 2 ; 2 ; 10 ; 20 ; 15 ; 0 ; 7 ; 0 ; 20 ; 5 ; 12 ; 15 ; 12 ; 4 ; 5 ; 10 .

Table 1.35 was produced.

  • Fix the errors in Table 1.35 . Also, explain how someone might have arrived at the incorrect number(s).
  • Explain what is wrong with this statement: “47 percent of the people surveyed have lived in the U.S. for 5 years.”
  • Fix the statement in b to make it correct.
  • What fraction of the people surveyed have lived in the U.S. five or seven years?
  • What fraction of the people surveyed have lived in the U.S. at most 12 years?
  • What fraction of the people surveyed have lived in the U.S. fewer than 12 years?
  • What fraction of the people surveyed have lived in the U.S. from five to 20 years, inclusive?

How much time does it take to travel to work? Table 1.36 shows the mean commute time by state for workers at least 16 years old who are not working at home. Find the mean travel time, and round off the answer properly.

Forbes magazine published data on the best small firms in 2012. These were firms which had been publicly traded for at least a year, have a stock price of at least $5 per share, and have reported annual revenue between $5 million and $1 billion. Table 1.37 shows the ages of the chief executive officers for the first 60 ranked firms.

  • What is the frequency for CEO ages between 54 and 65?
  • What percentage of CEOs are 65 years or older?
  • What is the relative frequency of ages under 50?
  • What is the cumulative relative frequency for CEOs younger than 55?
  • Which graph shows the relative frequency and which shows the cumulative relative frequency?

Use the following information to answer the next two exercises: Table 1.38 contains data on hurricanes that have made direct hits on the U.S. Between 1851 and 2004. A hurricane is given a strength category rating based on the minimum wind speed generated by the storm.

What is the relative frequency of direct hits that were category 4 hurricanes?

  • Not enough information to calculate

What is the relative frequency of direct hits that were AT MOST a category 3 storm?

1.4 Experimental Design and Ethics

How does sleep deprivation affect your ability to drive? A recent study measured the effects on 19 professional drivers. Each driver participated in two experimental sessions: one after normal sleep and one after 27 hours of total sleep deprivation. The treatments were assigned in random order. In each session, performance was measured on a variety of tasks including a driving simulation.

Use key terms from this module to describe the design of this experiment.

An advertisement for Acme Investments displays the two graphs in Figure 1.14 to show the value of Acme’s product in comparison with the Other Guy’s product. Describe the potentially misleading visual effect of these comparison graphs. How can this be corrected?

The graph in Figure 1.15 shows the number of complaints for six different airlines as reported to the US Department of Transportation in February 2013. Alaska, Pinnacle, and Airtran Airlines have far fewer complaints reported than American, Delta, and United. Can we conclude that American, Delta, and United are the worst airline carriers since they have the most complaints?

As an Amazon Associate we earn from qualifying purchases.

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute OpenStax.

Access for free at https://openstax.org/books/introductory-statistics/pages/1-introduction
  • Authors: Barbara Illowsky, Susan Dean
  • Publisher/website: OpenStax
  • Book title: Introductory Statistics
  • Publication date: Sep 19, 2013
  • Location: Houston, Texas
  • Book URL: https://openstax.org/books/introductory-statistics/pages/1-introduction
  • Section URL: https://openstax.org/books/introductory-statistics/pages/1-homework

© Jun 23, 2022 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.

IMAGES

  1. Statistics 101

    homework section 7 2 stats

  2. Homework Section 7.2

    homework section 7 2 stats

  3. Statistics 7th Edition Section 2 2 Homework Exercises #1 Through #7

    homework section 7 2 stats

  4. Solved Save Homework: Section 7.2 Question 1.7.2.5-T HW

    homework section 7 2 stats

  5. Solved Save Homework: Section 7.2 Suggested Homework Score:

    homework section 7 2 stats

  6. Solved Sa Homework: Section 7.2: Sampling Distributions

    homework section 7 2 stats

VIDEO

  1. Section 7 2 Volume The Disk method M102 T232 YouTube Part VI

  2. Section 2.4 Answer to a homework question

  3. Edexcel S2 June 2022 WST02 IAL Paper Q7

  4. Section 7 2 Volume The Disk method M102 T232 YouTube Part III

  5. Problem 7-13 (Hibbeler, Statics)

  6. Exercise 2.5, chapter 2, algebra, Class 7 Maths, gs academy, mathematics,7th math, new course

COMMENTS

  1. Section 7.2 Homework Flashcards

    Section 7.2 Homework. Refer to the accompanying data display that results from a sample of airport data speeds in Mbps. (13.046,22.15) a. Express the confidence interval in the format that uses the "less than" symbol. Given that the original listed data use one decimal place, round the confidence interval limits accordingly. b.

  2. Chapter 7: Sampling Distributions

    AP Statistics Link to Chapter 7 Video Notes. YOUTUBE PLAYLIST CHAPTER 7. Power Points & Notes Outline ... Lesson 7.2 Powerpoint; Lesson 7.3 Powerpoint AP Notes Chapter 7 Homework. Key HW 7.1 Part A problems 1-8 ... Examples for 7-2 are below. Normal Calculations Involving P-Hat.

  3. Introductory Statistics

    Section 1-1: Statistics Basics. Section 1-2: Simple Random Sampling. Section 1-3: Other Sampling Designs. ... Section 7-2: The Mean and Standard Deviation of the Sample Mean. Section 7-3: ... you'll learn how to solve your toughest homework problems. Our resource for Introductory Statistics includes answers to chapter exercises, as well as ...

  4. Math 1530 Section 7.2 HW #14

    Help video for Statistics section 7.2 problem 14, spring 2020. Finding and interpreting probabilities using the normal distribution.

  5. Chapter 7.2 Solutions

    CH7.2. Problem. 1CYU1. Step-by-step solution. Step 1 of 3. The following information provided in the problem. Suppose 75% of young adult internet users watch online videos. An SRS of 1000 young adult internet users taken and calculates the proportion in this sample who watch online videos. Step 2 of 3.

  6. Ch. 7 Solutions

    To calculate the probability, we use normalcdf (lower, upper, μ, σ n√. ( E - 99,16 .7,17, 0. 8 30) = 0.0200. If the process is working properly, then the probability that a sample of 30 batteries would have at most 16.7 life span hours is only 2%. Therefore, the class was justified to question the claim.

  7. AP Stats: Chapter 7

    Notice the organization of this Chapter. Section 7.1 is an introduction to sampling distributions, which includes sampling distributions for proportions and sampling distributions for means. Actually it includes sampling distributions for any statistic. Section 7.2, we investigate the shape, center, and variability of the sampling distribution ...

  8. Chapter 7.2 Solutions

    Access Essentials of Statistics Plus MyStatLab with Pearson eText -- Access Card Package 6th Edition Chapter 7.2 solutions now. Our solutions are written by Chegg experts so you can be assured of the highest quality!

  9. Ch. 7 Chapter Review

    2.2 Histograms, Frequency Polygons, and Time Series Graphs; 2.3 Measures of the Location of the Data; 2.4 Box Plots; 2.5 Measures of the Center of the Data; 2.6 Skewness and the Mean, Median, and Mode; 2.7 Measures of the Spread of the Data; 2.8 Descriptive Statistics; Key Terms; Chapter Review; Formula Review; Practice; Homework; Bringing It ...

  10. Chapter 7.2 Solutions

    Chapter. CH7.2. Problem. 1P. Step-by-step solution. Step 1 of 1. A standard score, also called a z-score, measures the number of standard deviations you are away from the mean of the normal distribution. For example, a z score of 1.25 means that the underlying data value is 1.25 standard deviations above its mean. Back to top.

  11. Chapter 7 Homework

    7.1 Homework. Among various ethnic groups, the standard deviation of heights is known to be approximately three inches. We wish to construct a 95% confidence interval for the mean height of male Swedes. Forty-eight male Swedes are surveyed. The sample mean is 71 inches. The sample standard deviation is 2.8 inches. ¯¯¯x x ¯ =________.

  12. Ch. 7 Formula Review

    2.2 Histograms, Frequency Polygons, and Time Series Graphs; 2.3 Measures of the Location of the Data; 2.4 Box Plots; 2.5 Measures of the Center of the Data; 2.6 Skewness and the Mean, Median, and Mode; 2.7 Measures of the Spread of the Data; 2.8 Descriptive Statistics; Key Terms; Chapter Review; Formula Review; Practice; Homework; Bringing It ...

  13. Section 7.2 Homework

    Section 7.2 Homework. Ping Sa. Course. Elementary Statistics For Health And Social Sciences (STA 2014) 64 Documents. Students shared 64 documents in this course. University University of North Florida. Academic year: 2018/2019. Uploaded by: Anonymous Student.

  14. Hw7Solution

    STAT22000 Homework 7 Solutions. All page, section, and exercise numbers below refer to the course text (OpenIntro Statistics, 4th edition, by Diez, Barr, and Cetinkaya-Rundel.). Reading: Section 7. Problems for Self-Study : (Do Not Turn In) Exercise 7, 7, 7, 7, 711 on p-261. The answers can be found at the end of the book. Problems to Turn In:

  15. Chapter 2 Homework

    2. 8-9. 1. Find the relative frequencies for each survey. Write them in the charts. Using either a graphing calculator, computer, or by hand, use the frequency column to construct a histogram for each publisher's survey. For Publishers A and B, make bar widths of one. For Publisher C, make bar widths of two.

  16. Stats 7.3 Flashcards

    Stats 7.3. Mean and Standard Deviation of the Sampling Distribution of x̅. Click the card to flip 👆. Suppose that x̅ is the mean of an SRS of size n drawn from a large population with mean µ and standard deviation σ, then: The mean of the sampling distribution of x̅ is µ of x̅ = µ. The standard deviation of the sampling distribution ...

  17. Ch. 2 Bringing It Together: Homework

    Let X = the length (in days) of an engineering conference. Organize the data in a chart. Find the median, the first quartile, and the third quartile. Find the 65 th percentile. Find the 10 th percentile. Construct a box plot of the data. The middle 50% of the conferences last from _______ days to _______ days.

  18. Section 7.2 Homework Problem Statcrunch

    Using Statcrunch to solve problems when the "View an Example" is using XLSTAT.

  19. Chapter 7.2 Solutions

    50BB. 51BB. 52BB. 53BB. 54BB. Access Elementary Statistics Technology Update 11th Edition Chapter 7.2 solutions now. Our solutions are written by Chegg experts so you can be assured of the highest quality!

  20. Ch. 1 Homework

    2.1 Stem-and-Leaf Graphs (Stemplots), Line Graphs, and Bar Graphs; 2.2 Histograms, Frequency Polygons, and Time Series Graphs; 2.3 Measures of the Location of the Data; 2.4 Box Plots; 2.5 Measures of the Center of the Data; 2.6 Skewness and the Mean, Median, and Mode; 2.7 Measures of the Spread of the Data; 2.8 Descriptive Statistics; Key Terms ...

  21. Solved myopenmath.com Math 219: Statistics and Probability

    Question: myopenmath.com Math 219: Statistics and Probability (Fa20) - Online > Assessment Homework: Section 8.2 Score: 3.5/13 3/13 answered Save progress Done 6 VO Question 11 B0/1 pt 2 298 Details SAT scores are normally distributed with a mean of 1,500 and a standard deviation of 300. An administrator at a college is interested in estimating ...