Testing 1,2,3 Unit

UNIT 5—Testing 123

Teacher Materials


CLICK THE  SYMBOL OF EACH SECTION HEADER TO RETURN HERE.
TEKS Support
Teacher Notes
 Context Overview
 Mathematical Development
 Preparation Reading—"Anabolic Steroids: Use and Effect"
 Discussion—Modeling Process
 Activity 1—"It’s Only a Test"
 Supplemental Activity 1—"Middle of a Model" (Manipulative Version)
 Homework 1—"Let’s Rumble"
 Activity 2—"Middle of a Model"
 Homework 2—"Expect to Be Worth Something"
 Activity 3—Finally, A Model
 Homework 3—It’s a Good Fit
 Activity 4—Which Model Fits the Best?
 Activity 5—Verifying the Model as Quadratic
 Supplemental Activity 5—Introduction to the Parabola
 Homework 5—Quadratic Nature of Pairing Samples
 Activity 6—Solving the Problem
 Homework 6—Practice With Quadratics
 HANDOUT 1—Activity 1 Part 2
 HANDOUT 2—Activity 1 Part 4
 HANDOUT 3—Dracula Program
 HANDOUT 4—Linear Regression for Activity 3
 HANDOUT 5—Exponential Regression for Activity 3
 HANDOUT 6—Quadratic Regression for Activity 3
 HANDOUT 7—Graph Grid for Supplemental Activity 5
Annotated Student Materials
 Preparation Reading—Anabolic Steroids: Use and Effect
 Activity 1—It’s Only a Test
 Supplemental Activity 1—Middle of a Model (Manipulative Version)
 Homework 1—Let’s Rumble
 Activity 2—Middle of a Model
 Homework 2—Expect to Be Worth Something
 Activity 3—Finally, A Model
 HANDOUT 4—Answers
 HANDOUT 5—Answers
 HANDOUT 6—Answers
 Homework 3—It’s a Good Fit
 Activity 4—Which Model Fits Best?
 Activity 5—Verifying the Model as Quadratic
 Supplemental Activity 5—Introduction to the Parabola
 Homework 5—Quadratic Nature of Pairing Samples
 Activity 6—Solving the Problem
 Assessment—Working With Quadratics
 Unit Summary
 Mathematical Summary
 Key Concepts
Solution to Short Modeling Practice
 Mini Modeling Problem: Linear and Quadratic Equations
Solutions to Practice and Review Problems



TEKS Support


This unit contains activities that support the following knowledge and skills elements of the TEKS.



(1) (A)

X

(4) (A)

X

(1) (B)

X

(4) (B)

X

(1) (C)

X

   
       

(2) (A)

X

(8) (A)

 

(2) (B)

X

(8) (B)

 

(2) (C)

 

(8) (C)

 

(2) (D)

X

   
       

(3) (A)

X

(9) (A)

 

(3) (B)

X

(9) (B)

 

(3) (C)

X

   

The mathematical prerequisites for this unit are

The mathematical topics included or taught in this unit are

The equipment list for this unit is




Teacher Notes






"Simple Model"—Context Overview

The unit examines testing strategies for sampling pairs of individuals when cost is a consideration. Pooling of samples may require only one test and therefore save money, or may require three tests, losing money in the process. Whether pairs of samples should be initially pooled for testing depends on the probability that a single result is positive.

Students gather data by performing Monte Carlo experiments assuming different probabilities, and try to determine the mathematical model for this situation. They verify empirical results, deriving a theoretical relationship between the incidence of occurrence and the expected number of tests. Finally, they determine the "break-even" point, where it’s more cost-effective to test the samples individually than to pool the samples and test them in pairs.





Mathematical Development

Deciding when it is appropriate to pool samples is explored through discussion of the modeling process, concrete activity, and calculator simulation. The concept of expected value plays a major role in establishing the condition for the problem. Data analysis, using least squares regression, determines the constants that produce the best fit for linear, quadratic, and exponential functions. Examination of the residual patterns provides a basis for choosing which of these functions best describes the data. Probability area models are introduced to verify that the relationship between the probability of testing positive and the expected number of tests required is quadratic, and to determine the exact quadratic that models the situation. Finally, methods for solving the resulting quadratic equation are explored, and the problem of determining when it is cost-effective is solved.





Preparation Reading—"Anabolic Steroids: Use and Effect"

The reading provides some background into the problem of detecting steroid use and the physiological effects of taking anabolic steroids. The intent of the reading assignment is to generate a discussion of the use of steroids in the general population, and to build some understanding for the context of testing for steroid use. Prior to class, students should read the contents of this article, and be prepared to discuss some of the following questions:

In addition, you might want to facilitate a discussion about current events that involve steroid testing or steroid use. Possibilities include: the Tour de France bicycling competition from the summer of 1998, Mark McGwire admitting that he takes Androä , Ben Johnson having his life ban from track and field upheld, or steroid testing of athletes at the local high school or college. Specific topics might include how the tests are administered, the stigma of a positive result, fairness issues when one sport bans the use of a substance and another allows it, "professional" wrestling and steroid use, or even talk-show spectacles about people who have to have a leg amputated from excessive steroid use.

Another option for you might be to have students research the topic of steroid use, using the sources cited below, the Internet, or other materials they find on public health. Have them bring in an interesting article on the subject to share with the class, or present a report on their research at various points in the unit.

The following resources were used to generate the Preparation Reading, and contain information you may find useful.

Wright, J.E., and V. Cowart, Anabolic Steroids: Altered States, Carmel, Indiana: Benchmark Press, 1990.

Donohue, T., and N. Johnson, Foul Play: Drug Abuse in Sports, New York: Basil Blackwell Ltd, 1986.

Meer, J., Drugs and Sports, New York: Chelsea House Publishers, 1987.





Discussion—Modeling Process

Having brought the issue of steroid testing to the students’ awareness, you need to shift their thinking to the mathematical nature of testing samples in pairs. The discussion of the modeling process has probably taken place in previous units, and begins with the students developing a well-defined question. While students may be considering many questions, cost is a critical factor in trying to determine when to pool samples.


The second step of the modeling process is to identify key features or assumptions that will be part of the development of the model, and to introduce variables into the problem.


The third step of the modeling process is to begin to examine the mathematical nature of the problem and establish a relationship between the two variables.


At this point, you can do one of two things:
1) announce to students that they will begin their exploration by trying to find out if the relationship between p and E is linear, and begin Activity 1, or
2) you can ask them how they might verify their conjecture to the number of tests needed when the probability of testing positive in the population is 0.5, and then begin Activity 1.





Activity 1—"It’s Only a Test"

   

= 1, 2, 3, 4, 5, 7, 8



Materials needed (per group):

Handout 1: Data Table for Part 2 (one per student)

Opaque bag containing tiles, chips or beads—two colors, representing a positive or negative test, and containing the same number of each color object (to model the situation in which p = 0.50)

Graphing calculators with DRACULA program (Handout 3) loaded into memory or

Handout 2: Data Table for Part 4 (one per student)



Part 1 of the activity is a review of the previous discussion. There are several ways in which you can proceed. You might let the students struggle with the questions from Part 1, and then have the discussion of the modeling process, with students recording answers for future reference. You might have the discussion first, and give students several minutes to answer these questions to make sure they understand what was discussed. Depending on the time in class, you could have the discussion in class and send the students home to formally answer the questions as part of a homework assignment.

Part 2 is where the students actually do the experiment. Model for the students how to perform the simulation; have two students playing the roles of the athletes, and you be the tester. One student should draw an object and put it back; then the other student should draw an object and put it back. Upon consulting with each other, they should announce whether the pooled test would be positive or negative. If positive, you select one of the students for an individual test, and he or she tells you the result. Finally, you record the results of the test on one row of the data table on Handout 1—the actual status determined by the color of the object drawn, the results of the test determined by the pooled result and which individual is tested first. Check to see if they understand what they are doing before allowing the students to begin the experiment.

In Part 3, students have the opportunity to answer the question, "How many tests should we expect when the probability of testing positive is 0.50?" Groups must communicate their results so the class data can be averaged together. The easiest way to do this is to have each group post its average number of tests on an overhead transparency or the board. Discuss the answers to Part 3 after the students have had a chance to think about them and formulate their own responses. Make sure that they understand that the results of the simulation indicate that it is possible to have the average number of tests be 2, but it’s not likely that we should expect the number of tests to average out to be 2. We need more data to minimize the fluctuations caused by individual trial results.

Part 4 has two versions to provide a little flexibility for individual teachers’ situations. The DRACULA simulation uses a calculator program to simulate the testing game with a larger number of trials; the commands that make up the program are included in the unit material as Handout 3. The other version, which uses coin tosses to simulate the testing game, is provided in case the calculators aren’t available, or if time is too limited to fully explore the activity. Handout 2 data table is for use with this version. In either case, students should discover two things about this situation: 1) increasing the number of trials to the simulation gives less individual fluctuation to the results, and 2) whatever the expected number of tests actually is, it’s slightly higher than 2 and not equal to 2.





Supplemental Activity 1—"Middle of a Model" (Manipulative Version)

   

In case the graphing calculators are not available for Activity 2, an optional development is provided in which students repeat the testing game from Activity 1. Each group models one of the probabilities mentioned in Activity 1 by adjusting the relative numbers of objects from which they are drawing. They need to repeat the experiment 100 times, so that the data will be fairly patterned. Then they share their data as a class, and the rest of the activity is the same as Activity 1. Refer to those teacher notes for more details.





Homework 1—"Let’s Rumble"

   

This is a collection of questions related to the testing game introduced in Activity 1. In discussing it with students afterward, ask students to connect Question 5 to the simulation done in class. Had they done 1000 trials, instead of 10 or 100, there would be even less fluctuation in the results and they would be even closer to the actual expected number of tests.





Activity 2—"Middle of a Model"

   

This activity should begin with a continuation of the discussion of the modeling process. We’ve determined that a probability of 0.5 for testing positive doesn’t yield two tests as the expected number. But does that mean that the behavior is linear, and that particular point isn’t exactly on the line? Or is the behavior nonlinear? And, if it isn’t linear, the natural question that begs to be asked is: What is it?

At this point, you can distribute Activity 2 and the graphing calculators. Each group should be assigned a probability; 8 groups of students can run simulations with p = 0.1, 0.2, 0.3, 0.4, 0.6, 0.7, 0.8, and 0.9. Remember that we already have data for p = 0.5, and the expected number of tests is determined when p = 0 and p = 1. The directions instruct the students to enter ‘1000’ as the number of trials; this will take several minutes. You could build in some short "sponge" activity while the students are waiting, or you could tell them to enter in a smaller number, like ‘100.’ The smaller number of trials will get an answer back much faster, but the data set won’t be as smoothly patterned. For the purposes of this activity and the rest of the unit, it doesn’t matter which way you go.

After the groups have run the simulation, a member from each group should record the group’s result on an overhead transparency or the board, and all students should record the class data on their activity pages in the data table. In analyzing the pattern, students are asked to "fit" the data with a line that starts at (0,1) and ends at (1,3). You may need to review how to find the equation of a line from two points. Make sure students understand the difference between observed value (numbers gotten from the calculator simulation) and expected value (numbers gotten by evaluating the line equation for the various values of p).





Homework 2—"Expect to Be Worth Something"

   

The first part of the homework activity will require students to have a pair of dice. The activity is meant to provide them an opportunity to explore the problem’s various outcomes and payoffs before tackling the problem by mathematical analysis. An option for you is to do that part of the homework activity in class and let students do the analysis when they get home. Be sure that students understand the definition of probabilistic worth and expected value, or at least call to their attention the fact that the boldface words are actually definitions, and tell you how to proceed with the calculations.





Activity 3—Finally, A Model

   

Materials needed

Graphing calculators

Handouts 4, 5, and 6
(1 each per student)

The purpose of this activity is to use regression analysis to determine if a linear, exponential, or quadratic function is the "best" type to use as a model. We’re continuing to use the data that have been collected on the expected number of tests and the probability of steroid use. Remind students that the process used in the previous activity (find an equation, calculate predicted values, calculate residuals, and examine the residual plot) is going to stay the same, but this time, the equations will be determined from regression analysis.

If students have never done regression, or were unsure of what they were doing, take the opportunity to show them how to do the linear regression (Part 1). Even if they are pretty clear of the idea, they may have forgotten how to use the graphing calculators to do that task, and reviewing with them the steps on the calculator isn’t a bad thing. Having done that, the students should proceed to do parts 2 and 3 on their own, so that they can discover the nature of the quadratic relationship themselves. Encourage them to explore other types of mathematical functions as well, even though that’s labeled an optional part.

Students should continue to Part 4 and answer the questions based upon the work that they’ve done. Have a brief discussion at the end, asking students to:





Homework 3—It’s a Good Fit

   

There are several options for the teacher in proceeding with this assignment. If students have access to graphing calculators, this assignment is a nice opportunity for them to practice developing linear models and determining if they truly are linear. If the students don’t have access to graphing calculators outside class, several options are still available.

  1. Have each student make a hand drawing of the data set, draw a line on the graph that is a reasonable "best" fit line, then determine the equation of that line from two points on the line. They can use the equation to calculate the predicted values and the residuals, and finally make a residual plot to determine if the data set is linear. Depending on your students, doing both data sets in this fashion might be tedious; however, students need to see the contrasting results to reinforce whether data is linear. You could assign one problem to half the class and the other problem to the other half of the class.


  2. Have students do all the data analysis in class with the calculators and record their table values; they would then go home to make the graphs and write their conclusions.


  3. Have it be an optional reteaching activity to be assigned as in-class work if they don’t understand the process well enough from Activity 3.




Activity 4—Which Model Fits the Best?

   

Students work in groups to analyze several data sets out of the context of steroid testing, and must determine the nature of the mathematical relationship among the data sets. There are a couple of ways to proceed with this activity:





Activity 5—Verifying the Model as Quadratic

   

 = 16, 17, 18



Materials needed

Transparency 1

This activity illustrates the idea of probability as an area. The specific example, which is the focus for Part 1, is the original testing problem, in which the assumption is that 50% of the population is taking steroids. Depending on your students, you can either lead the class through a discussion of the questions that form Activity 1 (with them recording the answers as they go) or let them work on that part for about 10-15 minutes and have a discussion of the material at that time. It’s important that the students identify each area and its meaning with respect to the probability conditions. Be sure that they verbalize the condition and understand why the values are being multiplied. Then proceed to Part 2, in which the problem is generalized to a condition that the probability of steroid use in the population is p, instead of 0.5. Students may struggle with the algebraic simplifications needed for Questions 5 and 7, but that’s the "punch line" to verifying the model as being quadratic, and well worth the effort. If needed, use Transparency 1 to review the questions from Part 2 in the follow-up discussion.

Remind your class that the probability that each of two events happens is calculated as follows:

P(A and B) = P(A) · P(B½A).

That is, "the probability of event A and event B both happening is equal to the product of the probability of event A times the probability of event B knowing that event A has already happened. If the probabilities of event A and event B are not related (independent events), the probability of "B given that A has occurred" is simply the probability of event B. It is only because we have assumed that the two individual tests are independent that we can state:

P(A and B) = P(A) · P(B).

Note: As you saw in the unit Imperfect Testing, the area model of probability can also be represented using probability trees. Though they are not discussed on the student pages of this unit, you may want to encourage students to use both methods on some problems.





Supplemental Activity 5—Introduction to the Parabola

   


Materials needed

Handout 7

Transparency 2

This activity is designated as a supplemental activity only because it doesn’t specifically address the problem of steroid testing and the cost-effectiveness of pooling blood samples. However, it is strongly recommended that a day be spent working on the various parts of this activity if the students have never seen a quadratic function, or if they haven’t mastered the concepts of translations and stretches of quadratics and the distinction between standard-form and vertex-form equations for parabolas.

In part 1, students concentrate on the graphs of the parent function y = x2 and translations on that function. Introduce them to the parent function by having them fill in the table of values and plot the points on the graph grid on Handout 7. Recall the concepts of domain and range, and reinforce the range limitation by asking the class to find the x-value that yields y = –1. Encourage discussion about "pairs" of points (x, y) and (–x, y), introducing the concept and terminology of line symmetry. Include several fractions and decimals in your table. Graph the function and label the vertex. (Use Transparency 2 to help you with the instruction, if needed). Use the graphing calculator to verify the drawing, and use the TRACE feature to verify the values recorded in the table.

Allow students to work in their groups to explore the transformations. There are four different transformations, so each student in the group can explore one of the shifts. Students should come together as a group to discuss their findings, and then answer Question 3. End the work for Part 1 by reviewing:
1) the shape of the parent parabola graph (y = x2),
2) the rules governing lateral translations of functions and
3) the domains and ranges of parabolas in the form yk = (xh)2.

In Part 2, students concentrate on how the graph of the parent function y = x2 is affected by stretch transformation and combinations of stretches and shifts. Once again, the work is divided into four parts, and each student in the group should explore two examples on his or her own. Then, the groups should collaborate to compare their results and draft responses to Questions 2 and 3. In the class discussion that follows the group explorations, ask if such stretch transformations change the domain or range of the parabola. (A horizontal stretch transformation would never change the domain or range, but a vertical stretch would change the range for parabolas whose vertex points are not on the x-axis.) Ask students if it matters whether one does the shift first and then the stretch, or the stretch first and then the shift. Close the discussion by reviewing how stretches in in both vertical and horizontal directions can be done.

In Part 3, students are asked to examine the two forms for the equation of a parabola: standard form and vertex form. They explore the parallel manipulation of the equation and its graph to discover how to change the equation from one form to another. Allow students to work through Questions 1-3, then discuss their findings before proceeding to Question 4, which gives students an additional opportunity to practice.





Homework 5—Quadratic Nature of Pairing Samples

   

This assignment asks students to examine the expected value calculation again, first using a specific value for the probability of testing positive (0.3) and then examining the model developed. Students are led to examine how changes in the probability affect the model geometrically (how do the various shapes adjust?) and analytically (how do the various pieces of the equation behave?).

A really effective way of exploring Question 1 would be to construct an area model on the computer using a dynamic drawing program like Geometer’s Sketchpad. Display the areas of the three regions representing the three different tests, and the probabilities associated with those regions. Then drag the intersection point from the middle of the square (Point E) toward the upper left or lower right corner and watch what happens to the areas and the calculations!





Activity 6—Solving the Problem

   

Closure on the problem of the cost-effectiveness of pooling blood samples is reached by examining the solution to the questions: "What is the break-even point? When is it cheaper to pool blood samples? When is it cheaper to test samples individually?" Various methods are explored for solving the quadratic equation describing the condition that our model should have a value equal to 2. Part 1 is pretty straightforward, using the calculator’s capability to determine intersection of two graphs. You might want to show your students the CALC menu feature that locates the intersection of two curves at this same time.

Part 2 examines the method of solving a quadratic equation called "completing the square." Students are led through this process and then asked to solve the modeling problem by applying their method. Let students work through Part 2; discuss individual aspects of the process as needed. After students solve the problem in Question 10, have them take a minute to compare their answers to the ones obtained in Part 1; they should verify the solution.

Finally, in Part 3, students are asked to generalize the work they did in Part 2 and to derive the quadratic formula. Let them work through Question 1, and have them tell you the steps involved. Make sure that there is no ambiguity or confusion in their process or thinking. Then let them write their answers to Question 2 before proceeding to the general equation. Discuss their work on Question 3 as needed, before letting the students go on to Question 4.





Homework 6—Practice With Quadratics

   

This assignment should be an opportunity for students to practice what they have discovered in the previous work, and a chance for you to see which students really "got it" and which are still struggling with the algebra developed in this last activity.





HANDOUT 1—Activity 1 Part 2

Data Table

Trial No.

 

1st Person’s Actual Status

2nd Person’s
Actual Status

 

Result of Pooled Test

Result of 1st Individual Test
(if needed)

Result of 2nd Individual Test
(if needed)

 

Number of Tests Needed

1

                 

2

                 

3

                 

4

                 

5

                 

6

                 

7

                 

8

                 

9

                 

10

                 


 

Average Number
of Tests:

 




HANDOUT 2—Activity 1 Part 4

Data Table

Trial
No.

Number
of Tests

 

Trial
No.

Number
of Tests

 

Trial
No.

Number
of Tests

 

Trial
No.

Number of Tests

1

   

26

   

51

   

76

 

2

   

27

   

53

   

77

 

3

   

28

   

53

   

78

 

4

   

29

   

54

   

79

 

5

   

30

   

55

   

80

 

6

   

31

   

56

   

81

 

7

   

32

   

57

   

82

 

8

   

33

   

58

   

83

 

9

   

34

   

59

   

84

 

10

   

35

   

60

   

85

 

11

   

36

   

61

   

86

 

12

   

37

   

62

   

87

 

13

   

38

   

63

   

88

 

14

   

39

   

64

   

89

 

15

   

40

   

65

   

90

 

16

   

41

   

66

   

91

 

17

   

42

   

67

   

92

 

18

   

43

   

68

   

93

 

19

   

44

   

69

   

94

 

20

   

45

   

70

   

95

 

21

   

46

   

71

   

96

 

22

   

47

   

72

   

97

 

23

   

48

   

73

   

98

 

24

   

49

   

74

   

99

 

25

   

50

   

75

   

100

 


Average Number of Tests:

 




HANDOUT 3—Dracula Program

(written for a TI-82 calculator)

PROGRAM:DRACULA

:0® C

:Disp "WHAT IS P?"

:Prompt P

:Disp "HOW MANY TRIALS?"

:Prompt N

:For(I,1,N,1)

:rand® A

:rand® B

:If A> P and B> P

:C+1® C

:If A< P

:C+3® C

:If A> P and B< P

:C+2® C

:End

:Disp "AVERAGE NUMBER"

:Disp "WAS"

:Disp C/N

:Stop





HANDOUT 4—Linear Regression for Activity 3

Equation: E =


Prob.
(p)

Observed
Value (E)

Expected
Value (E)


Errors
(Residuals)

0.0

     

0.1

     

0.2

     

0.3

     

0.4

     

0.5

     

0.6

     

0.7

     

0.8

     

0.9

     

1.0

     

 

Exponential Regression





HANDOUT 5—Exponential Regression for Activity 3

Equation: E =


Prob.
(p)

Observed
Value (E)

Expected
Value (E)


Errors
(Residuals)

0.0

     

0.1

     

0.2

     

0.3

     

0.4

     

0.5

     

0.6

     

0.7

     

0.8

     

0.9

     

1.0

     

 

Exponential Regression




HANDOUT 6—Quadratic Regression for Activity 3

Equation: E =


Prob.
(p)

Observed
Value (E)

Expected
Value (E)


Errors
(Residuals)

0.0

     

0.1

     

0.2

     

0.3

     

0.4

     

0.5

     

0.6

     

0.7

     

0.8

     

0.9

     

1.0

     

 

Exponential Regression





HANDOUT 7—Graph Grid for Supplemental Activity 5




Annotated Student Materials






Preparation Reading—Anabolic Steroids: Use and Effect

Anabolic steroids, synthetic compounds created to act like the male hormone testosterone, remain in the forefront of discussions regarding athletes and sports. Many athletes such as bodybuilders, weight lifters, runners, swimmers, and football players believe that steroids will give them strength advantages.

Actually, the development of anabolic steroids was an important scientific breakthrough. There are, in fact, four legal uses for the drugs. They are effective in treating certain forms of anemia, some kinds of cancer, pituitary dwarfism, and serious hormone disturbances.

By virtue of their specific chemical structure (which is totally different from any vitamin or amino acid), anabolic steroids can stimulate the genetic apparatus (DNA) within the cell of the nucleus. The DNA reacts to this stimulus by directing the production of specific new proteins. It is very important to understand that the biological response that occurs is dependent on the location and number of receptors. If the receptor is muscle, there will be a tissue-building effect. If the receptor is in the brain, there may be noticeable psychological and even behavioral effects. There are, incidentally, more receptors in "sexual" tissue than in muscle, which raises concern about the prospect of hypertrophy or cancer of the prostate gland.

Although the development of bigger muscles is a result of the anabolic component of the steroid, there is also an androgenic associated with the steroid. Androgenics cause growth of facial hair, lengthening of the vocal cords that results in the voice "breaking", acne, and the premature closure of the space between parts of the body’s long bones. Anabolic steroids can have harmful effects on the liver and the cardiovascular system (causing a higher risk for developing high blood pressure and blood-clotting problems). There is growing concern that flooding a young adolescent with synthetic sex hormones may disrupt not only physical changes but also normal psychological and emotional maturation that occur during adolescence.

Several studies have placed the incidence of anabolic-androgenic steroid use among high-school students at 6 to 7%. Figures as high as 11 percent of 11th-grade boys using steroids have been reported, meaning that between 250,000 and 500,000 high-school students have used steroids. Even more disturbing are reports that approximately 40 percent who have used steroids have used five or more cycles, and approximately 40 percent began using steroids before the age of 16. As expected, the majority of teenage users are participants in school athletics, but as many as a third of the male users are not involved in sports and take these drugs in an attempt to enhance their appearance.

There have been only a few surveys among the male college population, all of which show an estimated 2 percent usage. However, among male college athletes, the estimated usage rises to between 5 and 17%. The problem is not restricted to males. As many as 1 to 2 percent of senior high school girls are users. Between 3 and 5 percent of Olympic and professional female athletes have admitted to using them at some time during their careers.

Source: Write, J. E., and V. Coward, Anabolic Steroids: Altered States, Carmel, Indiana: Benchmark Press, 1990.





Activity 1—It’s Only a Test

   

 = 1, 2, 3, 4, 5, 6, 7, 8

Part 1: Introduction

  1. What is the central problem that we’re trying to solve?
  2. We’re trying to determine when it’s cheaper to "pool" blood samples in testing for steroid use.

  3. What are the variables in this problem?
  4. The likelihood, p, that someone is actually using steroids in the population, and the expected number of tests, E, that would be needed.

  5. When one tests for steroid use by "pooling" two blood samples, how many tests do you expect to have to make?
  6. The number of tests will always be one, two, or three tests.

  7. In testing for anabolic steroid use by pooling two samples together:


    1. If nobody is taking steroids, how many tests do you expect to have to make?
    2. One test every time, because the pooled sample would come up negative (clean).

    3. If very few people are taking steroids, how many tests do you expect to have to make?
    4. Most of the time, only one test would be needed. Once in a while, a second (or third) test would be required. The average number would be close to one.

    5. If almost all the people are taking steroids, how many tests do you expect to have to make?
    6. Most of the time, three tests would be needed–the pooled sample test would come up positive, and the first individual test would come up positive. Some of the time, only two tests would be needed, since the first individual test might be negative. The average number would be close to three.

    7. If everybody is taking steroids (and what a crazy world that would be!), how many tests do you expect to have to make?
    8. Three tests every time, because the pooled sample would always come up positive, and the first individual test would also come up positive.

  8. If exactly half the population were taking steroids, how many tests do you expect to need for each sample pair? Explain your reasoning.
  9. I guess that two tests would be needed. Since p = 0.5 is halfway between none of the people and all of the people, I expect that halfway between one test and three tests should be my answer.

  10. What modeling assumptions did you make in answering Question 5?
  11. The nature of the relationship between the two variables is linear; the expected number of tests increases proportionally to the population occurrence.



Part 2: Simulation

In groups, you are going to explore the problem of what happens when half the population is taking steroids. Two people will play the role of Olympic athletes, who have to be tested for steroid use. The third person will be the lab worker performing the test. Go through the simulation 10 times, recording the actual status of each athlete, the result of the pooled test, and the result of any individual tests, in the data table provided for this activity. Use marks of ‘+’ for a positive result or ‘–’ for a negative result. Then, determine the number of tests needed for each trial, and finally the average number of tests needed for the 10 trials.



Part 3: Analyzing Class Results

In the table below, record each group’s results for the average number of tests needed for 10 trials. Answers will vary; sample results provided in the table.

Group

1

2

3

4

5

6

7

8

9

10

Avg. No. of Tests

2.3

1.9

1.8

2.4

2.4

1.9

2.5

2.4

2.2

2.5

  1. As the tester, when would you be saving money in pooling samples?
  2. When the average number of tests was under 2 tests, since testing the two athletes individually would always require 2 tests.

  3. When half the population is using steroids, is it possible to save money by pooling samples in testing for steroid use?
  4. Yes. There were specific trials that required only 1 test, and, in certain groups, the average number < 2 tests.

  5. In this situation, is it likely that you would save money by pooling samples?
  6. Probably not. There were more group results that were > 2 tests, and the average of those results was » 2.2 tests. However, given the range of results, it’s hard to be sure. Some of the groups may have been very unlucky in choosing which of the individual samples to test first when the pooled sample came up positive.

  7. Think back to the simulation. Are there problems or limitations in the way in which we collected our data that might affect the results that we got?
  8. Possible answers include: not replacing the first person’s chip before drawing the second, having the chips stick to each other, nonuniform weights or shapes for the objects drawn, not mixing or shaking up the objects well enough, or looking at the objects as they are being drawn.

  9. How could we modify our simulation, to better understand what’s going on in the situation where half the population is taking steroids?
  10. Do more trials. Specifically, if each group did the test 100 times, the range of results might narrow down a bit, and the average of those results would get close to the "true" results.



Part 4: DRACULA Simulation of Situation

Each group will run the DRACULA program. When asked the first question, "What is P? P = ?", enter in the value ‘0.5’ (remember, we’re studying what happens when half the population is taking steroids). When asked the second question, "How many trials? N = ?", enter in the value ‘100’. After a few seconds, the calculator will provide the average number of tests needed.

In the table below, record each group’s results for the average number of tests needed for 100 trials.

Answers will vary; sample results provided in the table.

Group

1

2

3

4

5

6

7

8

9

10

Avg. No. of Tests

2.16

2.21

2.31

2.04

2.24

2.37

2.33

2.16

2.26

2.18

  1. Do you think it’s cheaper to pool samples in testing for steroid use when half the population is taking steroids? Explain.
  2. No. The range of possible results when only 10 trials were taken was [1.8, 2.5]. When the simulation was repeated with 100 trials, the range became [2.04, 2.37]. In every situation, the average number of tests was greater than 2, and, when you consider the total number of tests (n = 1000), the average » 2.23. Incidentally, the average of the total number of tests before was » 2.2 as well. I thought that the results were supposed to be an average of 2 tests. I wasn’t sure if the way the simulation was being run, or a streak of bad luck, might be causing me to get a different answer. But now, I’m convinced that it’s not supposed to average to 2 tests, and, if I needed even more convincing, I would simply repeat the simulation again, doing an even larger number of trials.

  3. In modeling the situation, two questions come up as a result of this activity. Review the problem and the original assumptions, and see if you can come up with those questions.


    1. The problem of "when is it cheaper to pool samples?" is now narrowed to a range of possible probabilities between [0, 0.5]. So a natural question to ask would be: "What is the probability of steroid use in the general population where the break-even point occurs?" or "At what probability would the number of tests be expected to be exactly two tests?"


    2. The assumption is that the relationship between p and E is not linear, or else we would have found that the number of tests would average to be 2 tests. So a natural question to ask at this point would be: "What is the exact nature of the relationship between p (the incidence of steroid use in the population) and E (the expected number of tests required in pooling two samples)?"


Part 5: Simulation by Tossing Pennies

You will repeat the simulation from today’s class, only this time using pennies. We’ll agree that a penny that comes up ‘tails’ will be a negative test result, and a penny that comes up ‘heads’ will be a positive test result. You will need someone else to play the role of the tester, but you might have figured out there is a shortcut to doing this:

Record the results of the coin-toss simulation in the data table provided. Calculate the average number of tests needed for your simulation, and come to class tomorrow ready to share your results.

In the table below, record 10 students’ results for the average number of tests needed for 100 trials.

Answers will vary; sample results provided in the table.

Group

1

2

3

4

5

6

7

8

9

10

Avg. No. of Tests

2.16

2.21

2.31

2.04

2.24

2.37

2.33

2.16

2.26

2.18

  1. Do you think it’s cheaper to pool samples in testing for steroid use when half the population is taking steroids? Explain.
  2. No, the range of possible results when only 10 trials were taken was [1.8, 2.5]. When the simulation was repeated with 100 trials, the range became [2.04, 2.37]. In every situation, the average number of tests was greater than 2, and, when you consider the total number of tests (n = 1000), the average » 2.23. Incidentally, the average of the total number of tests before was » 2.2 as well. I thought that the results were supposed to be an average of 2 tests. I wasn’t sure if the way the simulation was being run, or a streak of bad luck, might be causing me to get a different answer. But now, I’m convinced that it’s not supposed to average to 2 tests, and, if I needed even more convincing, I would simply repeat the simulation again, doing an even larger number of trials.

  3. In modeling the situation, two questions come up as a result of this activity. Review the problem and the original assumptions, and see if you can come up with those questions.


    1. The problem to solve becomes "What is the probability of steroid use in the general population where the break-even point occurs?" or "At what probability would the number of tests be expected to be exactly two tests?"
    2. The modeling question becomes "What is the exact nature of the relationship between p (the incidence of steroid use in the population) and E (the expected number of tests required in pooling two samples)?"




Supplemental Activity 1—Middle of a Model (Manipulative Version)

   

In Activity 1, we assumed that the probability was 0.50 for a person selected at random to be taking steroids. This activity will explore how different probability rates affect the number of tests required.

Your group will be assigned a particular probability to use, and then you will do the Testing Game again. Modify the number of each kind of marker used, so that it models the probability your group is working with. Then do the simulation 100 times and determine the average number of tests. When done, record your results in a table and share them with other groups according to directions given.

Record the class results for the average number of tests needed for 100 trials in the space below the appropriate probability of occurrence. Use a table similar to the data table in Handout 2, Activity 1.

Answers will vary; sample data provided in the table.

Probability (p)

0.0

0.1

0.2

0.3

0.4

0.5

Avg. No. of Tests (E)

1.00

1.25

1.56

1.85

2.06

2.24

Probability (p)

0.6

0.7

0.8

0.9

1.0

Avg. No. of Tests (E)

2.41

2.69

2.79

2.86

3.00



  1. Using the data collected, make a graph of Expected Value (E) vs. probability (p).
  2. In creating a model, the first clue is the pattern showing up in the scatterplot. Describe the graph; what kind of pattern seems to relate these quantities?
  3. It appears to be fairly linear, starting at (0,1) and going to (1,3).

  4. You may have answered the previous question by saying that the pattern looks linear. Let’s assume that it is. The next step is to describe the pattern in mathematical terms.


    1. Draw a line that starts at the first point (0,1), and goes to the last point (1,3). Find the slope of that line.
    2. Slope = (3 – 1) / (1 – 0) = 2

    3. Write the equation for the line in slope-intercept form; this will be our first model for these data.
    4. E = 2p + 1 or y = 2x + 1

  5. Now, we have to see how good a job our model does in describing the observed data. Fill in the table below; the actual values are the ones found from the simulation and recorded on the previous page. The equation above tries to "predict" those data, and so predicted values are found by taking each value for the probability and using it in the equation you just derived. Finally, errors (residuals) are how far the predictions are from the observed data, so they are found by subtraction:
  6. Error = Observed – Expected

    Probability
    (p)

    Observed
    Value (E)

    Expected
    Value

    Error
    (residual)

    0.0

    1.00

    1.00

    0.00

    0.1

    1.25

    1.20

    + 0.05

    0.2

    1.56

    1.40

    + 0.16

    0.3

    1.85

    1.60

    + 0.25

    0.4

    2.06

    1.80

    + 0.26

    0.5

    2.24

    2.00

    + 0.24

    0.6

    2.41

    2.20

    + 0.21

    0.7

    2.69

    2.40

    + 0.29

    0.8

    2.79

    2.60

    + 0.19

    0.9

    2.86

    2.80

    + 0.06

    1.0

    3.00

    3.00

    0.00

  7. Finally, in the grid provided above, make a graph of errors vs. probability (p).


  8. A good model will show a random pattern of residuals, with a balance of points above and below the zero line. Does the model you’ve developed have those properties?
  9. No. The residual plot has no points below the zero line and is not showing a random pattern.





Homework 1—Let’s Rumble

   
  1. Dante Takum is trying to arrange tag-team matches for his wrestlers, half of whom are currently taking anabolic steroids. The event organizers insist on having the wrestlers tested for steroid use. Each test costs $160.


    1. If you wanted to test each wrestler individually, what would the total cost be?
    2. (2)($160) = $320

    3. Based on the work done in Activity 1, how much would it cost (on average) to test the pooled sample from the two wrestlers on the tag team first, and then test individuals if needed?
    4. Based on an average of 2.23 tests per pair, it would cost (2.23)($160) = $356.80

  2. Carefully consider the following table that represents results from repeating the testing simulation from Activity 1. Was the game properly played? How do you know?
  3. Trial Number

    1

    2

    3

    4

    5

    6

    7

    8

    9

    10

    Number of Tests

    1

    3

    3

    1

    1

    1

    3

    1

    1

    3

    Probably not. The situation in which two tests were required never came up.

  4. In Activity 1, we modeled a situation in which half the population was taking steroids; we used two types of markers and had the same number of each color. If green means "steroid user," determine the smallest number of blue and green markers needed to model a population in which the probability of steroid use is:


    1. 75%
    2. 3 green, 1 blue

    3. 15%
    4. 3 green, 17 blue

    5. 23%
    6. 23 green, 77 blue

  5. Suppose you want to model a situation in which the probability of an individual taking steroids is 25%. Use the same color scheme as in the previous question.


    1. What is the smallest number of chips that you would need?
    2. 1 green and 3 blue

    3. If you drew a blue marker on the first draw, and did not replace the chip, what would be the probability of drawing a blue on the second draw? A green on the second draw?
    4. 2/3 or 0.67; 1/3 or 0.33

    5. If you drew a green on the first draw and did not replace the chip, what is the probability of drawing a green on the second draw? A blue on the second draw?
    6. 0%; 100%

  6. Suppose that you use a coin to simulate the situation in which the probability is 0.5 that a person in the general population is taking anabolic steroids, and that you use a "fair coin."


    1. Would that mean that, if you flipped the coin twice, you would always get one head and one tail?
    2. No, but that would be the most likely outcome.

    3. If you flipped the coin a million times, would you always get 500,000 heads and 500,000 tails?
    4. No, but that (very unlikely) event would still be the most likely outcome.

    5. Which do you think is more likely: The number of heads in 10 tosses would be between 4 and 6, or the number of heads in 100 tosses would be between 40 and 60?
    6. The second option is more likely.

    7. Suppose a fair coin was flipped ten times and happened to land heads eight of those times. If you flipped the coin another 100 times, how many heads would you expect to get?
    8. Approximately 50. This would naturally draw the overall percentage closer to 50% (i.e. 58 out of 110, if exactly 50 of the next 100 tosses are heads). Be careful not to assume that the data will automatically compensate for the initial results.

  7. In your simulation in Activity 1, it was important to replace the drawn chip before drawing another. Can you explain why? Can you think of a different situation where it would be important not to replace the chip?
  8. Not replacing the drawn chip changes the probability of drawing a green chip on the second draw. We want to model the situation in which both athletes have the same probability of testing positive.

    Many states have lotteries in which a collection of six numbers is chosen from 44 to 51 different possible numbers. In that situation, the numbers shouldn’t be replaced.





Activity 2—Middle of a Model

   

 = 6, 9, 10, 11

In Activity 1, we assumed that the probability was 0.50 for a person selected at random to be taking steroids. This activity will explore how different probability rates affect the expected number of tests required.

The teacher will assign a particular probability to each group, which will then run the DRACULA program. When prompted by the calculator program, give it the probability (decimal, please!), and set the number of trials to be 1000. Settle in for a couple of minutes; after the calculator has done all those trials, record your results in the table below and share them with other groups according to directions given.

In the tables below, record the class results for the average number of tests needed for 1000 trials in the space below the appropriate probability of occurrence:

Answers will vary; sample data provided in the table.

Probability (p)

0.0

0.1

0.2

0.3

0.4

0.5

Avg. No. of Tests (E)

1.000

1.300

1.554

1.879

2.042

2.237

Probability (p)

0.6

0.7

0.8

0.9

1.0

Avg. No. of Tests (E)

2.418

2.632

2.727

2.873

3.000

  1. Using the data collected, make a graph of Expected Value (E) vs. probability (p).
  2. In creating a model, the first clue is the pattern showing up in the scatterplot. Describe the graph; what kind of pattern seems to relate these quantities?
  3. It appears to be fairly linear, starting at (0,1) and going to (1,3).

  4. You may have answered the previous question by saying that the pattern looks linear. Let’s assume that it is. The next step is to describe the pattern in mathematical terms.


    1. Draw a line that starts at the first point (0,1), and goes to the last point (1,3). Find the slope of that line.


    2. Slope = (3 – 1) / (1 – 0) = 2

    3. Write the equation for the line in slope-intercept form; this will be our first model for these data.
    4. E = 2p + 1 or y = 2x + 1

  5. Now, we have to see how good a job our model does in describing the observed data. Fill in the table below; the actual values are the ones found from the simulation and recorded on the previous page. The equation above tries to "predict" those data, and so predicted values are found by taking each value for the probability and using it in the equation you just derived. Finally, errors (residuals) are how far the predictions are from the observed data, so they are found by subtraction:
  6. Errors = Observed – Expected

    Probability
    (p)

    Observed
    Value (E)

    Expected Value

    Error
    (residual)

    0.0

    1.000

    1.000

    0.000

    0.1

    1.300

    1.200

    + 0.100

    0.2

    1.554

    1.400

    + 0.154

    0.3

    1.879

    1.600

    + 0.279

    0.4

    2.042

    1.800

    + 0.242

    0.5

    2.237

    2.000

    + 0.237

    0.6

    2.418

    2.200

    + 0.218

    0.7

    2.632

    2.400

    + 0.232

    0.8

    2.727

    2.600

    + 0.127

    0.9

    2.873

    2.800

    + 0.073

    1.0

    3.000

    3.000

    0.000

  7. Finally, in the grid provided above, make a graph of errors vs. probability (p).


  8. A good model will show a random pattern of residuals, with a balance of points above and below the zero line. Does the model you’ve developed have those properties?
  9. No. The residual plot has no points below the zero line and is not showing a random pattern.





Homework 2—Expect to Be Worth Something

   
  1. A dice game is defined by two rules:


  2. Find the expected value.

    1. Play the game several times with a friend or a family member. Describe the strategy that you think would be best to apply to this game.
    2. Answers will vary. Possibilities include "12," since it comes up in a lot of different ways; "36," since it is the biggest outcome.

    3. Now, let’s apply some mathematics. First, we need to determine what the possible outcomes are when you multiply the two dice together, and how often they come up. Fill in the chart provided below with the winning answers:


    4. Outcome for Dice 2

       

      1

      2

      3

      4

      5

      6

      1

      1

      2

      3

      4

      5

      6

      2

      2

      4

      6

      8

      10

      12

      3

      3

      6

      9

      12

      15

      18

      4

      4

      8

      12

      16

      20

      24

      5

      5

      10

      15

      20

      25

      30

      6

      6

      12

      18

      24

      30

      36


    5. Some outcomes come up more than once, and some outcomes pay off more than others. Here we consider that.


    6. Value
      (n)

      Prob.
      (p)

      Worth
      (W)

      1

      1/36

      1/36 » 0.028

      2

      2/36

      4/36 » 0.111

      3

      2/36

      6/36 » 0.167

      4

      3/36

      12/36 » 0.333

      5

      2/36

      10/36 » 0.278

      6

      4/36

      24/36 » 0.667

      8

      2/36

      16/36 » 0.444

      9

      1/36

      9/36 » 0.250

      10

      2/36

      20/36 » 0.556

      12

      4/36

      48/36 » 1.33

      15

      2/36

      30/36 » 0.833

      16

      1/36

      16/36 » 0.444

      18

      2/36

      36/36 » 1.00

      20

      2/36

      40/36 » 1.11

      24

      2/36

      48/36 » 1.33

      25

      1/36

      25/36 » 0.833

      30

      2/36

      60/36 » 1.67

      36

      1/36

      36/36 » 1.00



      • In the first column, list all possible outcomes. Those are also the values for the outcomes.


      • In the second column, record the probability the outcome will happen. There are 36 possible dice rolls, so the probability is the (number of times the outcome takes place) ¸ 36.


      • The third column is for what’s called the probabilistic worth. It’s found by multiplying the value of an outcome by the probability that it happens.


  3. Now what should your strategy be for playing the dice game?
  4. I would predict "30" every time, since it provides the greatest worth, and there is no penalty for incorrect guesses.

  5. The expected value of an event is defined as the sum of the probabilistic worths of all the possible outcomes. Suppose you buy a $1 ticket for a raffle. One grand prize valued at $1,000 will be awarded. Two second prizes valued at $300 each will be awarded, and fifty consolation prizes valued at $20 each will be awarded. There are 10,000 tickets sold in the raffle. What is the expected value of one ticket? What is the meaning of the answer?
  6. $1000 (0.0001) + $300 (0.0002) + $20 (0.005) – $1 (1.000) = – $0.74

    Each time you "play" this raffle, you can expect to lose about 74 cents.

  7. What is the expected value of the dice game played on the previous page? What is the meaning of that answer?
  8. (1/36) + (4/36) + (6/36) + (12/36) + (10/36) + (24/36) + (16/36) + (9/36) + (20/36) + (48/36) + (30/36) + (16/36) + (36/36) + (40/36) + (48/36) + (25/36) + (60/36) + (36/36) = 49/4

    As a long-term behavior, the average payoff for the dice game is around 12.25 points.

  9. The table below shows data from a high school that was testing for steroid use among its athletes. Did the school save money by pooling blood samples?


  10. Number of Tests

    1

    2

    3

    Proportion of samples

    0.28

    0.31

    0.41

    The average number of blood tests needed would be calculated in this way:

    (1) (0.28) + (2) (0.31) + (3) (0.41) = 2.13.

    So, the school would have been better off testing the athletes individually.

  11. What does the mathematics of expected value have to do with the unit problem of steroid testing?
  12. The number of tests in each logical category is the "value," and the fraction of times those categories come up is the "probability." The expected value for this situation is the average number of tests required, which indicates whether it is cost-effective (below 2.00) or not.





Activity 3—Finally, A Model

   

 = 12, 14, 15

In modeling whether to pool samples when testing for drug use, we need to determine how the variables are related. We speculated that increases in the probability of steroid use (p) produce proportional increases in the expected number of tests required (E). Even though the data look linear, we found that the line connecting the first and last data points was a poor model.

In this activity, you are going to use the calculator’s statistical regression ability to fit the data in the "best" way possible. The menu that you will use allows you to "fit" data with functions that are linear, exponential, quadratic, or any other kind you might want to try. Regression equations will balance the number of data points on either side of the line or curve, and will minimize the sum of the squares of the residuals. Modelers must also pay attention to which kind of equation to use. To determine that, you must look for a random pattern in the residual plot (errors vs. probability).



Part 1: Linear Regression (uses Activity 3 Handout 4)

  1. In the 2nd column, record the data for the expected values (E) obtained from the work done in Activity 2.
  2. Enter the data from the table into your calculator. On the grid provided, draw a copy of the scatterplot of your data.
  3. Do a linear regression on the data. Record the equation obtained in the appropriate place on the handout. Sketch the graph of the line into your scatterplot.
  4. Calculate the predicted values, using the regression equation. Record the values in the 3rd column of the data table.
  5. Calculate the errors in using that equation to "model" the data. Record those values in the 4th column of the data table.
  6. On the other grid provided, draw the residual plot for this regression analysis.


Part 2: Exponential Regression (uses Activity 3 Handout 5)

Repeat the steps from the previous part of the activity, using exponential regression.





HANDOUT 4—Answers

Linear Regression for Activity 3

Equation:  E = 1.972p + 1.165



Prob.
(p)

Observed
Value (E)

Expected
Value (E)

Errors
(Residuals)

0.0

1.000

1.165

–0.165

0.1

1.300

1.362

–0.062

0.2

1.554

1.5601

–0.006

0.3

1.879

1.757

0.122

0.4

2.042

1.954

0.088

0.5

2.237

2.151

0.086

0.6

2.418

2.348

0.070

0.7

2.632

2.546

0.086

0.8

2.727

2.743

–0.016

0.9

2.873

2.940

–0.067

1.0

3.000

3.137

–0.137





HANDOUT 5—Answers

Exponential Regression for Activity 3

Equation:   E = 1.228 (2.767)p




Prob.
(p)

Observed
Value (E)

Expected
Value (E)

Errors
(Residuals)

0.0

1.000

1.228

–0.228

0.1

1.300

1.360

–0.060

0.2

1.554

1.506

0.048

0.3

1.879

1.667

0.212

0.4

2.042

1.846

0.196

0.5

2.237

2.043

0.194

0.6

2.418

2.262

0.156

0.7

2.632

2.505

0.127

0.8

2.727

2.773

–0.046

0.9

2.873

3.070

–0.197

1.0

3.000

3.399

–0.399




Part 3: Quadratic Regression (uses Activity 3 Handout 6)

Repeat the steps from the previous parts of the activity, using quadratic regression.







HANDOUT 6—Answers

Quadratic Regression for Activity 3

Equation:   E = –1.028p2 + 3.0p + 1.01



Prob.
(p)

Observed
Value (E)

Expected
Value (E)

Errors
(Residuals)

0.0

1.000

1.011

–0.011

0.1

1.300

1.301

–0.001

0.2

1.554

1.570

–0.016

0.3

1.879

1.818

0.061

0.4

2.042

2.046

–0.004

0.5

2.237

2.254

–0.017

0.6

2.418

2.441

–0.023

0.7

2.632

2.607

0.025

0.8

2.727

2.753

–0.026

0.9

2.873

2.878

–0.005

1.0

3.000

2.983

0.017

 


Optional Part: There are more types of regression built into the calculator you are using. Try some of the other methods, but be sure to follow the steps as you did earlier. And remember to record everything as part of this activity work.



Part 4: Finally, A Model!

  1. What do you think is the connection between the probability (p) and the expected value (E) in testing for steroid use?
  2. It’s quadratic. The model for these data is described by the equation:

    E = –1.028p2 + 3p + 1.01

  3. How did you arrive at that conclusion?
  4. When the data were "fit" with a quadratic equation, the residual pattern was random. The actual equation was found by doing quadratic regression on the calculator, so it is the type of regression that minimizes the sum of the squares of the errors.

  5. Think back over the process used to arrive at this model. In simulating the drug testing, you either used a calculator program with 1000 trials or drew markers 100 times. If you had analyzed data that represented only 10 trials instead:


    1. Would it have changed the kind of model you chose (linear, exponential, quadratic, or so on.)?
    2. No, the nature of pooling samples is quadratic; the number of trials won’t change that.

    3. Would it have changed the actual equation you recorded?
    4. Yes, the regression equation was based on the data that were generated by the calculator program having done 1000 trials. Unless we got the exact same numbers when doing only 10 trials, we would get a slightly different equation.

    5. Would it be a "better" model? Explain what that means.
    6. It is likely that the experimental results will more closely approach the expected value as the number of trials increases.





Homework 3—It’s a Good Fit

   

Below are two data sets. Graph each separately; be sure to indicate the scales used. For each graph, find the equation for the line that you think models the data most closely. Plot the line, then plot the residuals. Does there seem to be a pattern to the residuals? Do you think a line is the best kind of equation to fit these data?


  1. x

    –4

    10

    22

    32

    36

    51

    62

    68

    78

    89

    96

    y

    39

    66

    102

    145

    192

    233

    285

    355

    414

    481

    533

    The line of best fit is roughly y = 5.1x + 8.2 The residuals seem to "curve upward" (line is not the best kind of equation).


  2. x

    4

    6

    20

    30

    40

    47

    60

    70

    79

    92

    99

    y

    65

    58

    52

    46

    44

    32

    31

    22

    18

    14

    13

    The line of best fit is approximately y = –0.54x + 63. The residuals seem to be scattered around the zero line, so a line is the best kind of equation to use.





Activity 4—Which Model Fits Best?

   

We’ve seen how the kinds of models that you use to describe the patterns in data are varied—linear, exponential, quadratic, and so on. Each can be used to fit the data, but one will work best by producing a random residual plot.

Work on the data set(s) assigned to your group. Prepare an analysis of your data, make the graphs of your data and the residuals, and state the model that best fits your data. Write a summary of your results and a logical argument defending your choice of model for your group’s presentation to the class. If the data are nonlinear, explain what might cause the rate of change to not be constant.



Data Set 1

Volume of a Drop of Water From a BeralTM Pipet

(A Beral pipet is similar to a medicine dropper and is used in chemistry experiments.)

Number of Drops

20

30

40

20

7

14

21

5

35

55

Volume (ml)

0.92

1.38

1.85

0.88

0.30

0.60

0.90

0.20

1.52

2.48

This is strongly linear: y = 0.029 + 0.046x



Data Set 2

Oil Leaking from a Container

Time (hr)

0.5

1.5

2.5

3.5

Rate (liters/hr)

32

27

24

22

The line of best fit is y = –3.3x + 32.85. From the graph, you can see a definite pattern. A quadratic function will probably fit it (since a quadratic can be found to fit most data sets with only 4 points). It is not linear because less pressure pushes the oil out of the hole as the amount of oil decreases.

The best quadratic fit is y = 0.75x2 – 6.3x + 35



Data Set 3

U.S. Electricity Consumption

E is billions of kilowatt-hr/year.

(Years are set up so that 1900 is year 0)

Years

12

17

20

29

36

45

55

60

65

70

80

87

E

12

25

39

92

109

222

547

755

1055

1531

2286

2455




This graph is obviously not linear. The increase in the use of electricity has not been constant since 1990. The best fit may be exponential, although the curve seems to follow a logistic pattern toward the end. (If you drop the last two data points, the exponential fit is much better.)



Data Set 4

A Falling Stone

Time (sec)

0

1

2

3

4

5

Distance (meters)

0

4.7

19.8

45.1

80.3

124.9


This is quadratic: y = 4.99x2 + 0.05x – 0.15



Data Set 5

One Swing of a Pendulum

Length (cm)

10

20

30

40

50

60

70

80

Time (sec)

0.6

0.9

1.1

1.4

1.4

1.5

1.7

1.8



If students use the power function, they will find that the function y = 0.19x0.52 approximates the data nicely. (The actual relationship is very nearly a square-root function.)





Activity 5—Verifying the Model as Quadratic

   

 = 16, 17, 18

We’ve seen how the data collected on the expected number of tests are best described by a quadratic function. To be completely satisfied with this result, it would be nice to understand why it is quadratic in nature. Think back to the original simulation for this problem (Testing Game). We assumed that the probability of a positive test occurring would be 0.50. Let’s see how the problem relates to the square below, which is divided into regions. The sides of the square all have a length of 1. The total area represents a total probability of 1. The rows represent the first person’s probabilities on the test, and the columns represent the second person’s probabilities.


Part 1: Specific example

    1. What outcome does the region EFIH represent?
    2. Both people testing negative

    3. What outcome does the region EFIH represent?
    4. (0.5)(0.5) = 0.25

    1. Under what conditions are exactly two tests needed?
    2. When the combined sample was positive and first of the two athletes tested negative, the second athlete had to be positive. Then exactly two tests were needed. Thus, since the first person gets the first individual test, two tests are needed when only the second person tests positive.

    3. What region(s) of the figure represent(s) the probability of needing exactly two tests?
    4. DEHG

    5. What is the probability of needing exactly two tests?
    6. (0.5)(0.5) = 0.25

    1. Under what conditions are three tests needed?
    2. When the combined sample was positive and the first athlete was positive, then three tests are needed. Whenever the first person tests positive, three tests will be needed.

    3. What region(s) represent(s) the probability of needing exactly three tests?
    4. ABED and BCFE, or ACFD

    5. What is the probability of needing exactly three tests?
    6. (0.5)(0.5) + (0.5)(0.5) = 0.50 or (1.00)(0.5) = 0.50

  1. What is the sum of the probability answers to questions 1, 2, and 3? Explain the meaning of your result.
  2. One. Since you would always need 1, 2, or 3 tests, it would happen 100% of the time. Therefore, the probability should be 1.

  3. Write an expression for the expected number of tests, and then calculate it. Compare your answer to the value obtained in Activity 1.
  4. 1(0.25) + 2(0.25) + 3(0.50) = 2.25.

    In Activity 1, the expected value for a probability p = 0.5 was around 2.23. That was an experimental result based on the performance of the simulation; the answer of 2.25 is a theoretical result based on probability.

  5. Explain how the area model represents probabilities. How could you go about generalizing this problem?
  6. The area of the region actually corresponds to the value of the probability. Each region corresponds to two conditions (test result for first and second persons), and, in finding the overall probability of both events taking place, one must multiply the individual probabilities together. The dimensions of the regions correspond to the actual probabilities for those events, and, in calculating area, one must multiply the dimensions together.

    To generalize the problem, introduce variables to represent the probability that one of the people tests positive for steroid use.

  7. If p is the probability of testing positive, how can you represent the probability of not testing positive?
  8. Use the expression 1 – p



Part 2: Generalization

Let’s go through the same analysis as before, but this time we will use the variable p to represent the incidence of steroid use in the population rather than a specific number like 0.5. The diagram provided at right will help you answer the following questions.

  1. What probability is represented by the area of EFIH, the region requiring only 1 test?
  2. (1 – p)2

  3. What region(s) represent(s) the probability of needing exactly two tests? What probability is represented by the area of that region?
  4. DEHG, because this area represents a positive pooled sample and a negative first individual test. The probability for that case is represented by p (1 – p).

  5. What region(s) represent(s) the probability of needing exactly three tests? What probability is represented by the area of that region?
  6. ACFD; students may also say there are two—BCFE and ABED. The probability for that situation is represented by p.

  7. Write an algebraic expression that represents the total area for the model as a sum of its parts.
  8. (1 – p)2 + (1 – p)p + p(1 – p) + p2

  9. Simplify the algebraic expression you wrote for Question 4 and show that the total has a value of 1.
  10. (1 – p)2 + (1 – p)p + p(1 – p) + p2 = 1 – 2p + p2 + pp2 + pp2 + p2 = 1

  11. Write an algebraic expression for the expected number of tests needed when pooling samples.
  12. 1(1 – p)2 + 2(1 – p)p + 3(p(1 – p) + p2)

  13. Use the distributive property to simplify the algebraic expression you wrote for Question 6.
  14. 1(1 – p)2 + 2(1 – p)p + 3(p(1 – p) + p2) = 1(1 – 2p + p2) + 2(pp2) + 3(pp2 + p2) = 1 – 2p + p2 + 2p – 2p2 + 3p = 1 + 3pp2

  15. Compare the equation you just got in Question 7 with the model developed from doing quadratic regression from Activity 3.
  16. The model developed by using quadratic regression was given by the equation E = –1.028p2 + 3p + 1.01. You can see that they are both quadratic, they have exactly the same linear coefficient, and the other two coefficients are almost exactly the same.





Supplemental Activity 5—Introduction to the Parabola

   

Part 1

  1. To begin, let’s look at the simplest parabola, y = x2. Complete the table below, and then, using your graphing calculator, plot some points and sketch the curve.


  2. x

    0

    1

    3

    –1

    2

    –2

    can’t

    –1.414

    2/3

    1/2

    –2.24

    2.24

    y

    0

    1

    9

    1

    4

    4

    –4

    2

    4/9

    1/4

    5

    5



  3. The following four tables represent parabolas that have been changed. Use the information that you just found to complete the table, plot the points, and sketch the graph. Each person in the group should tackle one of the problems, then share their work with the group before going on to answer Question 3.
  4. Person 1: y = (x + 3)2

    x

    –3

    –2

    0

    –4

    –1

    –5

    can’t

    »–1.59

    –2 1/3

    –2 1/2

    »–0.76

    »–5.3

    y

    0

    1

    9

    1

    4

    4

    –4

    2

    4/9

    1/4

    5

    5


    Person 2: y = x2 – 3

    x

    0

    1

    –1

    2

    –2

    3

    –3

    1/2

    3/2

    –3/2

    y

    –3

    –2

    –2

    1

    1

    6

    6

    2

    2

    –1 1/4

    –3/4

    –3/4


    Person 3: y = (x 2)2

    x

    2

    1

    –1

    3

    0

    4

    can’t

    2+

    8/3

    5/2

    2+

    6

    y

    0

    1

    9

    1

    4

    4

    –4

    2

    4/9

    1/4

    5

    16


    Person 4: y = x2 + 2

    x

    0

    1

    –1

    2

    –2

    3

    –3

    1/2

    3/2

    –3/2

    y

    2

    3

    3

    6

    6

    11

    11

    7

    7

    9/4

    17/4

    17/4



    1. How would you describe the shift for the new graph you drew, from where you drew the graph of y = x2?

      Person 1: Shifts 3 units to the left

      Person 3: Shifts 2 units to the right

      Person 2: Shifts down 3 units

      Person 4: Shifts up 2 units


    2. The lowest point (vertex) for the graph of y = x2 is (0,0). Where is this point located for your graph?
    3. Person 1: (–3, 0)

      Person 3: (2, 0)

      Person 2: (0, –3)

      Person 4: (0, 2)


  5. What would be a rule that would explain how to draw the graph of a parabola that has been changed by adding or subtracting a value in the equation for the parabola?
  6. Adding/subtracting before the squaring operation shifts to the left/right.

    Adding/subtracting after the squaring operation shifts up/down.


Part 1 Graphs

Part 2

You can stretch functions by multiplying or dividing by a number in particular places in equations. Recall what the basic parabola, y = x2, looks like; a table of values has been listed again to help you get started.

  1. Work in groups; use your graphing calculators to complete the tables, plot some points, and sketch the curves.
  2. y = x2

    x

    0

    1

    –1

    2

    –2

    3

    –3

    y

    0

    1

    1

    4

    4

    9

    9



    Person 1:
    y/2 = x2
    Person 2:
    y = (x – 2)2 – 1

    x

    0

    1

    –1

    2

    –2

    3

    –3

     

    x

    2

    1

    3

    0

    4

    5

    –1

    y

    0

    2

    2

    8

    8

    18

    18

     

    y

    –1

    0

    0

    3

    3

    8

    8



    y = (x/2)2 y = (x/2 – 2)2 – 1

    x

    0

    2

    –2

    4

    –4

    6

    –6

     

    x

    4

    2

    6

    0

    8

    10

    –2

    y

    0

    1

    1

    4

    4

    9

    9

     

    y

    –1

    0

    0

    3

    3

    8

    8



    Person 3:
    y = 3x2
    Person 4:
    y = (x/4)2

    x

    0

    1

    –1

    2

    –2

    3

    –3

     

    x

    0

    4

    –4

    8

    –8

    12

    –12

    y

    0

    3

    3

    12

    12

    27

    27

     

    y

    0

    1

    1

    4

    4

    9

    9



    y = (x/2 – 1)2 – 2 y = (x – 1)2 – 2

    x

    4

    2

    6

    0

    8

    10

    –2

     

    x

    2

    1

    4

    0

    –1

    3

    –2

    y

    –1

    –2

    2

    –1

    7

    14

    2

     

    y

    –1

    –2

    7

    –1

    2

    2

    7



  3. How would you describe the way the new graph looks, compared to y = x2?


  4. Person 1:

    First one is thinner;
    second one is wider.

     

    Person 2:

    First one is shifted to the right by 2 and down 1; second one is shifted the same, but opens wider.

    Person 2:

    First one is narrower; second one is shifted to the right 1 and down 2, and also wider.

     

    Person 4:

    First one opens wider; second one is shifted to right 1 and down 2.



  5. What rule would explain how to draw the graph of a parabola that has been changed by multiplying or dividing a value in the equation for a parabola?
  6. Multiplying the x-variable by a number greater than 1 (or dividing the y-variable) will "stretch" the graph, making it steeper and the U-shape narrower. Dividing the x-variable by a number greater than 1 (or multiplying the y-variable) will "shrink" the graph, making the U-shape wider.


Part 2 Graphs



Part 3

The form y = a(xh)2 + k is called the vertex form for a parabola because the graph has its vertex at (h, k). The form y = ax2 + bx + c, where a, b, and c are numbers, is called the standard form for a parabola.

  1. Let’s start with an equation in vertex form: y = 2(x – 3)2 – 1. Describe the three transformations that have been done to the standard (parent) graph y = x2 to produce this new equation.
  2. The standard graph was first shifted to the right by 3 units, then stretched by a factor of 2. Finally, it was shifted down 1 unit.

  3. The steps to changing the graph into standard form are shown below. Describe the algebra that is being applied in each step.
  4. y = 2(x – 3)2 – 1

    Given problem

       = 2(x2 – 6x + 9) – 1

    Exponents first; square the binomial using FOIL

       = 2x2 – 12x + 18 – 1

    Distribute multiplication over the parentheses

       = 2x2 – 12x + 17

    Combine like terms (two constant terms)

  5. Now, we want to be able to go in the reverse direction, and change an equation from standard form to vertex form. Only three steps are needed to make the change. Start with the equation:
  6. y = 2x2 – 12x + 17

    1. Recognize that the constant "a" is the same in both forms. What is the value of "a" if we put our equation in the vertex form?
    2. "a" is the same in both so a = 2

      Teacher Note: Proof of the above statement follows.

      y = a(xhk)2 + k

      y = ax2 – 2ahx + (ah2 + k)


    3. Graph the equation, y = 2x2 – 12x + 17, on your calculator and estimate the coordinates of the vertex. These coordinates are the values to h and k. What are the values of h and k for our equation.
    4. The coordinates of the vertex are (3, –1), do h = 3, and k = –1

    5. Use the known values for a, h, and k to write the equation in the vertex form.
    6. y = 2(x – 3)2 – 1

  7. Change the following from standard form into vertex form or vice versa.


    1. y = 3x2 + 12x – 2
    2. y = 3(x + 2)2 – 14

    3. y = 1.5(x – 3)2 + 4
    4. y = 1.5x2 – 9x + 17.5






Homework 5—Quadratic Nature of Pairing Samples

   
  1. Let’s repeat our analysis from Activity 5 on the expected number of tests required for a particular probability. This time, we will assume the probability of testing positive in the general population is 0.3.

    1. What is the expected number of tests in this situation?
    2. 3(0.3)(1) + 2(0.7)(0.3) + 1(0.7)(0.7) = 1.81

    3. Compare your answer to part a) with the value obtained in Part 1 of Activity 5, where you assumed the probability for testing positive was 0.5. Did the expected number of tests go up or down?
    4. The expected value went down, which agrees with the intuition of the problem (less people taking steroids will mean more single tests, because the pooled sample will test negative) and the nature of the model developed (function is strictly increasing on the interval [0..1], so, if p decreases, E should go down as well).

    5. Compare the diagram above with the diagram used in Part 1 of Activity 5. What is different about it? How does that help you understand how changing the probability affects the expected number of tests?
    6. The rectangle ACFD, which represents the region in which three tests are needed, is thinner. The rectangle DEGH, which represents the region in which two tests are needed, is longer and thinner. The square EFIH, which represents the region in which only one test is needed, is larger. Shifting a number of "three tests" trials to "needing one test" means that the average number of tests should go down, which is what we’ve been calling the expected value. While the area of DEGH didn’t change much (0.25 to 0.21), eventually it would be much thinner and would also decrease appreciably.

    7. What would the diagram look like if the probability for testing positive was 0.0? What would it look like if the probability was 1.0? What does the area model predict for the expected number of tests under those conditions?
    8. If p = 0, square EFIH would cover the entire region ACIG and the expected number of tests would be 1(1)(1) = 1. If p = 1, square ABED would cover the entire region ACIG and the expected number of tests would be 3(1)(1) = 3. Both of those results agree with the actual testing problem.

  2. Now let’s consider the generalized problem shown by the area diagram below. The general equation developed in Activity 5 was given by the equation: E = –p2 + 3p + 1.

    1. What is E(0.5)? (What does the equation predict the expected number of tests to be, when the probability is 0.5?)
    2. –(0.5)2 + 3(0.5) + 1 = 2.25

    3. What is E(0.3)?
    4. –(0.3)2 + 3(0.3) + 1 = 1.81

    5. Compare those answers to the ones obtained in Part 1 of Activity 5 and this Homework.
    6. They are the same.

    7. What is E(0.0)?
    8. –(0.0)2 + 3(0.0) + 1 = 1.00

    9. What is E(1.0)?
    10. –(1.0)2 + 3(1.0) + 1 = 3.00

    11. Fill in the following chart; several of the lines have already been done. How does this help you understand what happens as the probability of testing positive changes in the population?
    12. As p gets closer to 0, the term that gives the expected value its greatest influence (+ 3p) goes toward 0. The term that adjusts the expected value downward (–p2) also goes to 0. Therefore, the expected value heads toward the constant +1.

      As p gets closer to 1, the first and third terms will cancel each other out and the expected value will head toward the value of the middle term.


      p

      p2

      +3p

      +1

      0.0

      0.00

      0.00

      +1.00

      0.1

      –0.01

      +0.30

      +1.00

      0.2

      –0.04

      0.60

      +1.00

      0.3

      –0.09

      +0.90

      +1.00

      0.4

      0.16

      +1.20

      +1.00

      0.5

      –0.25

      +1.50

      +1.00

      0.6

      –0.36

      +1.80

      +1.00

      0.7

      –0.49

      +2.10

      +1.00

      0.8

      –0.64

      +2.40

      +1.00

      0.9

      –0.81

      +2.70

      +1.00

      1.0

      –1.00

      3.00

      +1.00



    13. Consider the expected value as a "weighted" area: one EFIH, plus two DEHG, plus three ABED, plus three BCFE. That should give you p2 + 3p + 1. Explain how that is possible.
    14. 1 EFIH + 1 DEHG + 1 ABDE + 1 BCFE = 1 (sum of probabilities)

      1 DEHG + 1 ABDE = 1 p

      2 ABDE + 1 BCFE = 2 p

      –1 ABDE = –p2

      Therefore, 1 EFIH + 2 DEHG + 3 ABDE + 3 BCFE = –p2 + 3p + 1





Activity 6—Solving the Problem

   

 = 13, 19, 20

The final task in modeling the pooling of blood samples is to answer the question, "When is it cost-effective to pool samples? When is it going to save us money?"

Part 1: Calculator Method

  1. Write the equation that models the testing situation.
  2. E = –p2 + 3p + 1

  3. How many tests would we expect per pair, if we were testing everybody individually? If we want to save money, what should we be looking for?
  4. 2 tests would be required per pair. To save money, we should keep the average number of tests below 2.

  5. One way to solve the problem would be to consider the equation that models the situation and the condition on the problem at the same time. Using your graphing calculator, make y1 be the expression you wrote as the answer to Question 1 and y2 be the answer you wrote for Question 2.
    1. What WINDOW settings are appropriate for the problem?
    2. Answers will vary. One possibility is: xmin = –0.1, xmax = 1.1, xscl = 0.1, ymin = –0.1, ymax = 3.1, yscl = 0.5

    3. Describe the graph in words.
    4. The expected value graph curved slightly downward, starting at 1 and going up to 3. The condition graph is a horizontal line at 2.

    5. Use TRACE to find the point(s) of intersection of the two graphs, and record the x-value(s). What is the meaning of the x-value, in the context of comparing costs between testing athletes by combining samples and by testing them individually?
    6. p » 0.38; if the population’s proportion of positive tests is at least 38%, then pooled testing costs more.


Part 2: "Completing the Square" Method

In previous work in this unit, you learned how to use area models to obtain the quadratic model –p2 + 3p + 1. We used the formula to predict the expected number of tests when p = 0.5. Setting up the equation –p2 + 3p + 1 = 2 allows us to work backward, and determine what the probability would have to be if we want the expected number of tests to be two (break-even point). This is an important enough area of mathematics that we will actually develop two methods for solving this quadratic equation. The first method is called completing the square. It is based on the same idea as the area models.

  1. Here is a quadratic expression:
  2. 2(x + 3)2 + 2

    List the order in which you evaluate this expression, and then evaluate the expression using x = 4.

    Inside parentheses, do the addition

    4 + 3 = 7

    Exponents

    72 = 49

    Multiplication

    49 * 2 = 98

    Final addition

    98 + 2 = 100

  3. Knowing the order of operations can help in solving equations. Use the results from the previous question to solve the equation: 2(x + 3)2 + 2 = 20
  4. Cancel final addition operation by subtracting 2

    2(x + 3)2 = 18

    Cancel the multiplication by dividing by 2

    (x + 3)2 = 9

    Cancel the squaring by finding thesquare root

    x + 3 = 3 or –3

    Cancel the addition operation by subtracting 3

    x = 0 or –6

    The second method for solving quadratic equations uses the vertex form.

  5. In the general equation y = a(xh)2 + k, if you knew the values for y, a, h and k, describe how you would solve the equation for x.
  6. The importance of the vertex form for a quadratic equation is this: if you can express a quadratic equation in the form y = a(x – h)2 + k, you can solve it by undoing the order of operations. Now, let’s concentrate on writing quadratics in vertex form.

  7. Graph each of the following "perfect square" expressions.


    1. y = (x + 4)2
    2. y = (x – 5)2
  8. What do the graphs of these perfect square expressions have in common?
  9. All graphs are tangent to the x-axis.

  10. For the equation y = x2 – 4x
    1. Sketch the graph.


    2. Determine what must be added to the expression to make it a perfect square.


    3. Graph the new equation as well.


    4. Write the new expression as a perfect square.


    5. +4; y = (x – 2)2

  11. For each of the following, follow the instructions for Question 9 and rewrite each as a perfect square.


    1. x2 + 8x
    2. x2 + 8x + 16 = (x + 4)2

    3. x2 – 10x
    4. x2 – 10x + 25 = (x – 5)2

  12. How does the middle number (linear coefficient) in the expanded form compare to the terms in the "square" form?
  13. Half the linear coefficient is the constant in the square form.

  14. Now, let’s put it together and solve the quadratic equation x2 – 3x = 10 by "completing the square."


    1. Graph the equation.
    2. What do we need to add to make the left side a perfect square (or, what number do we need to add to the left side to make it touch the x-axis exactly once)?
    3. 2.25 or 9/4

    4. Add that number to both sides and graph again.
      x2 – 3x + ____ = 10 + ____
    5. x2 – 3x + 9/4 = 10 + 2.25

    6. Now write the squared form of the left side and simplify the right side.
    7. (x – 3/2)2 = 12.25

    8. Write the solutions.
    9. + 1.5 = 5 or – + 1.5 = –2

  15. The equation that represented the "break-even" point in testing pooled samples is given by the quadratic equation –p2 + 3p + 1 = 2. Solve that quadratic equation by "completing the square."
  16. First, it will be easier to work out the steps if the leading term is made positive and the constants are combined on one side of the equation.

    Multiply through by –1 to get:

    p2 – 3p – 1 = –2

    Add one to each side to make it:

    p2 – 3p = –1

    You need to increase the expression by (half the value of the linear term)2 to complete the square.

    Add 2.25 to both sides to get:

    p2 – 3p + 2.25 = 1.25

    Rewrite it as:

    (p – 1.5) = 1.25

    Find the square root on both sides:

    p – 1.5 = +

    Add 1.5 to both sides

    p = + + 1.5

    » 0.62 or 0.38


    Note: Only one of the answers makes sense in the context of the probability of testing positive for steroid use.


Part 3: Quadratic Formula Method

The process of completing the square applies as well to the general form of the quadratic equation, ax2 + bx + c = 0. By solving the quadratic equation in the general case, the quadratic formula is obtained.

  1. In Part 2, you developed a method for solving quadratic equations. Review the method of completing the square by using it to solve the equation: 2x2 + 20x + 18 = 0.


  2. x2 + 10x + 9 = 0

    x2 + 10x = 0 – 9

    x2 + 10x + 25 = –9 + 25 = 16

    (x + 5)2 = 16

    x + 5 = + 4

    x = –5 + 4 = –9 or –1



  3. Write a set of instructions summarizing what you did in Question 1 so that someone else could use them to solve that equation.
  4. Steps should include (i) completing the square by isolating the constant term, making the leading coefficient 1 (by dividing by a), and adding to both sides the square of half the new linear coefficient; and (ii) solving by taking the square roots and adding or subtracting, as indicated by arrow diagrams.

  5. Apply your instructions to solve the equation ax2 + bx + c = 0.


  6. The quadratic formula: x =

    Note to Teacher: Quadratic Formula Derivation



  7. Use the quadratic formula to solve the equation: 3x2 + 10x + 7 = 0.
  8. x = = = or





Assessment—Working With Quadratics

   
  1. Identify all values of x that make the equations true:


    1. (x – 3)2 = 4
    2. 1 or 5

    3. (3x – 1)2 = 9
    4. 4/3 or –2/3

  2. Write the expanded form for:


    1. (x – 3)2 =
    2. x2 – 6x + 9

    3. (3x – 1)2 =
    4. 9x2 – 6x + 1

  3. Complete each square, then write the squared form:


    1. x2 – 16x + _____
    2. 64; (x – 8)2

    3. x2 + 7x + _____
    4. 49/4; (x + 7/2)2

  4. How do you know that (x – 5)2 is not equal to x2 + 25? Explain. Diagrams may be used.
  5. Sample answers: Squaring (x – 5) will give a middle term of –10x. The graph of (x – 5)2 is tangent to the x-axis, while the graph of x2 + 25 is not. Evaluate each expression for a particular number, and you will not get them to equal each other (except at x = 0).

For each of the following, solve by completing the square.

  1. x2 + 4x = 12
  2. x2 + 4x + (2)2 = 12 + 4

    (x + 2)2 = 16

    = +

    (x + 2) = +4

    x = –2 + 4 = 2 or x = –2 – 4 = –6

  3. x2 + 8x – 11 = 0.
  4. x2 + 8x = 11

    x2 + 8x + (4)2 = 11 + 16

    (x + 4)2 = 27

    =

    x + 4 = +

    x = –4 +

  5. 5x2 – 10x – 30 = 0.
  6. x2 – 2x – 6 = 0

    x2 – 2x = 6

    x2 – 2x + 1 = 6 + 1

    (x – 1)2 = 7

    =

    x – 1 = +

    x = 1 +

  7. Complete the square for the following area model.
    1. As the model is currently drawn, what algebraic expression does it represent?
    2. x2 + 8x

    3. How many little squares (area of each = 1) would you need to add in order to complete the big square?
    4. 16

    5. What algebraic expression would the model then represent?
    6. (x + 4)2

    7. What algebraic expression would represent the sum of the various pieces being used to build the big square?
    8. x2 + 8x + 16

    9. Use the quadratic formula to solve the equation:

      2x2 + 5x – 9 = 0.

    10. x = = 1.21 or –3.71





Unit Summary

In this unit, the problem of combining blood samples in testing for steroid use was examined. The central question was to determine when it would be cheaper to test the pooled sample and when it would be cheaper to test samples individually. Assumptions were made in developing the model; they included assuming the cost of testing to be fixed and assuming that positive tests always correctly detected the presence of steroids. The logic of the problem dictated that the outcome would always be one, two, or three tests, which is a discrete range, but average results of long-term behaviors yielded data that seemed to be continuous over the same range. Simulations were performed and data collected. The data were then analyzed to determine the mathematical relationship that connected the probability of testing positive to the expected number of tests (on average). The model was verified by examining the situation in probabilistic terms, and then a solution was obtained for the "break-even" point.





Mathematical Summary

In simulating the testing for steroid use, first with manipulatives and then using a calculator program, the result of any one trial was always 1, 2, or 3 tests, but the average was any number between 1 and 3, inclusive. This distinction points to the fact that the event is discrete, but the long-term behavior isn’t. The law of large numbers came into play in the simulation, as a group of students did 10 trials, with lots of fluctuation on the average number of tests. The calculator programs were run with many more trials, and still provided fluctuations on the average number of tests. The interval containing all of the answers was much smaller, provided a better estimate of the expected value.

The ideas of probabilistic worth and expected value came into consideration because the trials had varying "payoffs." Sometimes a single trial needed a single test, while, at other times, a single trial needed three tests. The idea to "weigh" the trials with the number of tests needed is where probabilistic worth was introduced. Averaging all of those weighted results gave us an idea of what to expect as long-term behavior, or a gauge of what to expect on an individual trial, and that was the expected value for this situation.

Regression analysis was applied to the data collected to disprove the conjecture that the behavior in the data was linear. Further analysis showed that the relationship between the variables was actually quadratic. In all cases, it was the pattern in the residual plot that either said "Hey, we found the right model!" (random dots above and below the zero line) or "Nice try, but no cigar!" (pattern to the plot).

To verify the model, probability area models were used to explore the problem analytically. Since two people were being tested and they could be either positive or negative in their individual test results, the four regions of the square represented the four possible situations. The area of a region represented the probability that the situation represented by that region would happen. Multiplying the area by the number of tests would give the probabilistic worth for that outcome. Adding all those answers together gave the expected value for the testing situation. The difference is that this was the "theoretical" or expected value, not the results obtained from "experimental" averages. An expression was derived describing E(p), the way in which the expected value depends on the probability of steroid use in the population, and it turned out that the expression was quadratic.

Having discovered a new mathematical relationship, sometimes called a quadratic and other times called a parabola, some exploration into their properties was provided. The "parent graph"

y = x2

formed a "U" shape with the vertex located at the origin. Translation transformations were investigated. Replacing the x with an expression xh moved the graph over h units to the right. If h was a negative number, it moved the graph to the left instead. Similarly, replacing the y with an expression yk moved the graph up k units. If k was a negative number, it moved the graph down instead. The quadratic

(yk) = (xh)2

has a vertex at point (h, k).

Stretching transformations were also investigated, in which the x2 and y occurrences were multiplied or divided by numbers. The effect of that act was to cause the graphs to open wider or close narrower. The vertex form of the quadratic

y = a(xh)2 + k

is translated by h and k and stretched by a.

The algebra of quadratics was also explored. Vertex form for a quadratic, which looks like

yk = a(xk)2

and standard form, which looks like

y = ax2 + bx + c

were introduced, and methods to go from one form to the other were practiced. Solving quadratic equations, which was necessary to actually get the answer for which we had built the entire model, is such an important skill that three different methods were introduced. The first method was to graph each side of the equation as a separate function and TRACE to the intersection point. The second method was a technique called "completing the square." The third method was to develop a general formula, called the quadratic formula

which allows you to find the solutions by simply plugging in the three numbers from the standard equation.





Key Concepts

Average Number of Tests: Long-term behavior of the testing game found by taking the total number of tests and dividing it by the number of trials that were done.

Domain, Range for the Testing Problem: The domain would be the allowed values for the probability of testing positive, [0...1]. The range would be the expected number of tests needed, [1...3]

Probabilistic Worth: Mathematical quantity that is defined as the product of the outcome times the probability that the outcome takes place.

Expected Value: Sum of all the probabilistic worths; describes the average payoff for an event that might happen a lot of times.

Law of Large Numbers: A concept in mathematics that, says that, as the number of trials gets large, the average (experimental) behavior will tend to be closer to the expected (theoretical) behavior.

Regression: An analysis that "fits" data to mathematical functions by minimizing the sum of the squares of errors

Residuals: How far off a prediction made by a mathematical function is from the observed data it’s trying to predict.

Stretch Transformations: When an expression or answer is multiplied or divided by a number, it has the effect of distorting a graph in the vertical or horizontal direction by stretching or shrinking it.

Shift Transformations: When a number is added to (or subtracted from) a variable in a function, it has the effect of moving the graph either to the left/right (x-variable) or up/down (y-variable).

Vertex: The place where a parabola reaches the bottom or top of its curve.

Line Symmetry: A line that goes through the vertex and divides the parabola into two equal (mirror image) sides

Domain, Range for a Quadratic: The domain is the allowed values for x; since all values are allowed, the domain is all real numbers. The range is the possible answers for y, which is determined by whether the quadratic is upright or inverted and where the vertex point is located.

Vertex-Form Equation of a Quadratic: yk = a(xk)2

Standard-Form Equation of a Quadratic: y = ax2 + bx + c

Completing the Square: Technique for solving a quadratic equation by putting the quadratic expression into vertex form. Then, steps for working backward to solve the equation become step-by-step arithmetic processes.

Quadratic Formula: Formula for solving a quadratic equation ax2 + bx + c = 0. The solution is found by evaluating the formula:

x =




Solution to Short Modeling Practice






Mini-Modeling Problem: Linear and Quadratic Equations



t (sec)

x (ft)

0.0

0.0

0.1

2.9

0.2

5.8

0.3

8.7

0.4

11.6

0.5

14.5

0.6

17.4

0.7

20.2

0.8

23.1

0.9

26.0

  1. The students’ tables of time values and x-values should appear as shown here. A graph of x vs t should clearly appear linear, as shown below.
  2. Students can deduce the two parameters for the linear graph: the slope and the intercept. The intercept is equal to 0, the distance at time t = 0 sec. The slope can be found by taking any pair of points, such as (0, 0.0) and (0.5, 14.5):

    Help students to avoid confusion with the typical notation used above, where y denotes the vertical axis and x denotes the horizontal axis. They should not be confused in this problem where the horizontal distance is the variable x and the independent variable is t. Thus, the linear equation for the horizontal distance is

    x = (29.0 ft/sec) t + 0

    Some students may perform a linear regression on the data to obtain:

    Slope = 28.9
    Intercept = 0.03
    r2 = 0.99999

    Approximately the same results.


    t (sec)

    y
    (ft)

    0.0

    7.0

    0.1

    10.3

    0.2

    13.3

    0.3

    15.9

    0.4

    18.2

    0.5

    20.2

    0.6

    21.9

    0.7

    23.3

    0.8

    24.3

    0.9

    25.1



  3. The student’s tables of time values and y-values should appear as shown here. A graph of y vs t should appear non-linear, as shown below.
  4. Students can suppose it is quadratic and determine the coefficients. In this case, it is probably most efficient to perform a quadratic regression to fit the data to an equation of the form y = ax2 + bx + c, yielding:

    a = –15.8

    b = 34.3

    c = 7.0

    r2 = 0.99998

    The strong correlation coefficient with the large number of data values indicates that the relationship probably is indeed quadratic. So the equation describing the vertical motion can be written:

    y = (–15.8) t2 + (34.3) t + (7.0)

  5. Use the horizontal distance equation to determine how many seconds it would take to reach the basket. Set x = 55 feet and solve for t.
  6. 55 ft = (29.0 ft/sec) t

    t = = 1.90 sec

    Substitute this time value into the equation for the vertical height and see whether the ball would have had the correct height when it reached the plane of the goal.

    y = (–15.8)(1.90)2 + (34.3)(1.90) + (7.0) = 15.1 ft

    Since the basket is at y = 10 ft, Kareem’s shot would have missed the goal, being over 5 feet high when reaching the basket. (As noted in the student text, we have considerably simplified the test for accuracy of Kareem’s shot here: ignoring the lateral aim, as well as the fine points about how the ball actually strikes the rim of the basket, and so forth. If the calculations had shown the ball to be within about 0.2 feet of the target, we would probably be inclined to say that the shot was "on target." You can ask your students for their opinions on this point: "How close must one be to make the basket?")


    Solution to Challenge

    The ball struck the TV equipment somewhere within it’s trajectory. So, we can at least say that the TV equipment must have been within the range of y-values predicted by the equation for the vertical distance. Students can graph the equation for y from above and find that the ball could have gone as high as 25.6 feet (or 7.8 meters) on its way to the basket. Thus, it is possible that the TV equipment was safely above the 7 meter limit and still interfere with Kareem’s shot. We cannot say with certainty that the equipment was below the 7 meter clearance.



Solutions to Practice and Review Problems


Exercise 1

50

Exercise 2

6.1

Exercise 3

60%

Exercise 4

30

Exercise 5

  1. 55 miles per hour


  2. y = 55x

Exercise 6

y = x +

Exercise 7

  1. The students need to think about this problem a bit, and then they will find the solution quite simple. The total possible ways to dial the last four numbers is the product of the number of choices for each of the four positions. In this case, this is 10 ´ 10 ´ 10 ´ 10, or 104. However, with the premise that the number is incorrectly dialed, one of the 104 possibilities is eliminated, leaving 9999 possibilities. And, of course, there is only one favorable outcome for this problem–your telephone number. Since all the possible outcomes are equally likely, the formula can now be applied.
  2. Probability =

    Probability =

    Probability = 0.0001 (rounded)

  3. This is similar to the case above: there are 1000 possible three-digit area codes (that is, 10 ´ 10 ´ 10). There is a total of 999 possible numbers that are incorrect, with only one of these equal to your area code.
  4. Probability =

    Probability =

    Probability = 0.001 (rounded)

Exercise 8

  1. There are 18 favorable ways to obtain Brand A out of a total of 70 possible ways.
  2. Probability =

    Probability =

    Probability = 0.257, or 25.7 (rounded)

  3. Yes, all the events are equally likely. This is because the calculators are distributed randomly. Thus, any one of the 70 calculators is equally likely to be distributed to you.
  4. The possible outcomes of this procedure are that you receive 1) Brand A, 2) Brand B, 3) Brand C, or 4) Brand D.

    Yes, these four outcomes are mutually exclusive. This means that you cannot be given one calculator and have it be both Brand A AND Brand C, for example. Any outcome can fall only into one and only one of the four possible outcomes listed above.

  5. Since the outcomes are mutually exclusive you can use the formula below.
  6. Prob {Brand A OR Brand B} = Prob {Brand A} + Prob {Brand B}

    First, the student must determine the probability of getting Brand B. As was done in Part a...

    Probability =

    Probability = 0.429, or 42.9% (rounded)

    Then,

    Prob {Brand A OR Brand B} = 0.257 + 0.429

    Prob {Brand A OR Brand B} = 0.686

    Or,

    Prob {Brand A OR Brand B} =

    Prob {Brand A OR Brand B} = 0.686

Exercise 9

  1. Only two possible events can occur. Either the tree IS INFESTED or it is NOT INFESTED.


  2. Yes, these two events are mutually exclusive—a tree cannot be both INFESTED and NOT INFESTED at the same time.
  3. No, they are not equally likely—at least you hope that there is more of a tendency to be not infested, rather than simply a 50-50 chance of infestation.

  4. Since the events are not equally likely, you must rely on the observed probability.
  5. Probability =

    Probability =

    Probability = 0.4, or 40%

Exercise 10

  1. The students’ charts should appear generally as shown below.


  2.  

    A

    A

    A

    AA

    AA

    B

    AB

    AB



  3. The probabilities can be computed by examining each outcome and the total number of outcomes. In this case, there are two possible outcomes: AA and AB. Each can occur in two cases, out of a total of four possible outcomes.


  4. So for AA,

    Probability =

    Probability =

    Probability = 0.5

    And, similarly for AB.

    Outcome

    Probability

    AA

    0.5

    AB

    0.5



  5. With four children you would expect the ratio of children with gene pairs of AA to all the children to be 1 : 2. Thus you would expect half of the four children, or two children, to have gene pairs of AA, or blood type A.
  6. Similarly, you would expect two children to have gene pairs of type AB, or blood type AB.

    Since none of the outcomes yielded a gene pair of BB, none of the children would be expected to have blood type B.

EXERCISE 11

  1. (8) (7) (6) (5) (4) (3) (2) (1) = 40,320 ways


  2. Probability = » 0.000025, or about 0.0025%

Exercise 12

y = 80,000(0.92)x ; about 52,727 people

Exercise 13

  1. The equation d = 9t is linear. The equation d = 0.9t2 is quadratic.



  2. (0,0) and (10,90). After ten seconds, both sprinters travel the same distance.

Exercise 14

  1. The new area of the field will be the new length (120 + x) times the new width (80 + x). Of course, it doesn’t matter which we call the length and which we call the width. So,
  2. New area = New length × New width

    New area = (120 + x)(80 + x)

    New area = 9600 + 200x + x2

  3. The new area desired is 20% more than the present area. So, with all measurements in feet, and the area result in square feet,
  4. New area = Present area + 20% of Present area

    New area = (120 × 80) + 0.20 (120 × 80)V

    New area = 11,520

    Equate this to the result of Part a, and simplify to get it into the form of a quadratic equation.

    9600 + 200x + x2 = 11,520

    x2 + 200x – 1920 = 0

  5. Use the quadratic formula, where a = 1, b = 200, and c = –1920.
  6. x =

    x =

    x = –100 ± 1/2

    x = 9.18 or –209.18

    Thus the farmer would need to plow an additional 9 meters of width and length to obtain a 20% increase in the plowed area. The negative root is not meaningful in this case.

Exercise 15

  1. Let h = 0 and solve for t in h = –16t2 + 1000.
  2. –16t2 + 1000 = 0

    t2 =

    t = ± » 7.9 or –7.9

    It will take about 7.9 seconds.

  3. Replace 1000 with 1500.
  4. T = ± » 9.68 or –9.68

    It will take about 9.68 seconds.

Exercise 16

  1. Let T be the total cost in dollars, H be the height, and the width be equal to 2H + 5. Then with a setup fee of $150 and the cost per ft2 of $10 we have:
  2. Total cost = Setup fee + Cost per ft2 (Height ´ Width)

    T = 150 + 10 (H )(2H  + 5)

    T = 150 + 50H + 20H2

  3. Substitute T = $4000 and put in form for quadratic formula.
  4. 4000 = 150 + 50H + 200H2

    20H2 + 50H – 3850 = 0

    One way to solve this equation is to use the quadratic formula, where a = 20, b = 50, and c = - 3850.

    H =

    H =

    H = –1.25 ± 1/40

    H = 12.68 ft or – 15.18 ft

    So, since the negative result is meaningless here, the result tells us that a sign 12.68 ft high is the maximum that can be afforded from this billboard company.

Exercise 17

  1. The correct choice is equation 1. The students will likely take a trial-and-error approach (trying various values of d to see what values of c develop) to answer this question, which is quite all right. They could also notice that the equation must give c = 1000 at d = 0, so equation 3 is eliminated. The shape of the parabola is upward, which eliminates equation 2. (They should know that the negative coefficient on the d 2 term will make the parabola open downward.) That leaves either equation 1 or equation 4. Equation 4 does not decrease since there are no negative terms, so it must be equation 1. In fact, trying a few values for d verifies that equation 1 is the correct choice.
  2. For = 0, c = 1000.

    For = 5, c = 600.

    For = 10, c = 300.

    For = 15, c = 100.

    Note that the domain of this application requires that only values for d between 0 meters and 20 (thousand) feet are valid.

  3. Set c = 10 and solve for d.
  4. 10 = 2d2 – 90d + 1000

    2d2 – 90d + 990 = 0

    The values for d that satisfy the above equation can be found using the quadratic formula, where a = 2, b = –90, and c = 990.

    d =

    d =

    d = 22.5 ±  1/4

    d = 19.1 thousand feet or 25.9 thousand feet

    Since the second value is not in the domain of our function, the first value is the correct answer: At d = 19.1 thousand feet the concentration is 10 ppm.

Exercise 18

Since this is basically a task in substitution and solving a quadratic, little guidance is offered the students. They should substitute P = 60 watts, V = 110 volts, and R = 30 ohms into the equation and solve the quadratic equation for I.

P = VII 2R

60 = 110 I – (30) I 2

30 I 2 – 110 I + 60 = 0

We’ll solve this quadratic using the quadratic formula, where a = 30, b = –110, and c = 60.

I =

I =

I = 1.833 ±  1/60

I = 3.00 amperes or 0.67 amperes

So, a current of either 0.67 ampere or 3.00 amperes will produce 60 watts of power at the load.

Exercise 19

  1. The total width is 20 inches. If two sections, each of length x, are bent up, the remaining width is 20 – 2 x (in inches).
  2. The height of the rectangular area is x and the width is 20 – 2x. Since the area of a rectangle is length times width, and this area must equal 38 in2,
  3. Area = Length × Width

    245 = (20 – 2x)(x)

    Multiply out the terms and put the result in standard form.

    38 = 20x – 2x2

    2x2 – 20x + 38 = 0

  4. Find the roots with the quadratic formula, where a = 2, b = –20, and c = 38.
  5. x =

    x =

    x = 5 ±  1/4

    x = 5 ±  2.45

    x = 7.45 in. and 2.55 in.

    Each root "fits" the problem here!

  6. The sketches should appear generally as shown above, showing a deep gutter for the root x = 7.45 in., and a shallow gutter for the root x = 2.55 in. The solution selected by the student is really arbitrary, but most would probably select the deeper gutter made using x = 7.45 in.

Exercise 20

  1. Substitute the span distance s = 40 ft and the radius of curvature
    R = 25 ft, and put in standard form.
  2. (40 ft)2 = 8(25 ft)h – 4h2

    4h2 – 200h + 1600 = 0

  3. We’ll solve this quadratic using the quadratic formula, where a = 4, b = –200, and c = 1600.
  4. h =

    h =

    h = 25 ±  1/8

    h = 40 ft or 10 ft

    The value of 40 ft probably won’t be accepted since that would describe an arch that was nearly a complete circle above the ground. You can draw a picture for your students that looks like the one shown above. Thus the second solution is the one we want: h = 10 ft. This depicts an arch like the one shown in the text, whose center height is 10 ft above the ground.