Testing 1,2,3 Unit

UNIT 5—Testing 123

Teacher Materials


CLICK THE  SYMBOL OF EACH SECTION HEADER TO RETURN HERE.
TEKS Support
Teacher Notes
 Context Overview
 Mathematical Development
 Preparation Reading—"Anabolic Steroids: Use and Effect"
 Discussion—Modeling Process
 Activity 1—"It’s Only a Test"
 Supplemental Activity 1—"Middle of a Model" (Manipulative Version)
 Homework 1—"Let’s Rumble"
 Activity 2—"Middle of a Model"
 Homework 2—"Expect to Be Worth Something"
 Activity 3—Finally, A Model
 Homework 3—It’s a Good Fit
 Activity 4—Which Model Fits the Best?
 Activity 5—Verifying the Model as Quadratic
 Supplemental Activity 5—Introduction to the Parabola
 Homework 5—Quadratic Nature of Pairing Samples
 Activity 6—Solving the Problem
 Homework 6—Practice With Quadratics
 HANDOUT 1—Activity 1 Part 2
 HANDOUT 2—Activity 1 Part 4
 HANDOUT 3—Dracula Program
 HANDOUT 4—Linear Regression for Activity 3
 HANDOUT 5—Exponential Regression for Activity 3
 HANDOUT 6—Quadratic Regression for Activity 3
 HANDOUT 7—Graph Grid for Supplemental Activity 5
Annotated Student Materials
 Preparation Reading—Anabolic Steroids: Use and Effect
 Activity 1—It’s Only a Test
 Supplemental Activity 1—Middle of a Model (Manipulative Version)
 Homework 1—Let’s Rumble
 Activity 2—Middle of a Model
 Homework 2—Expect to Be Worth Something
 Activity 3—Finally, A Model
 HANDOUT 4—Answers
 HANDOUT 5—Answers
 HANDOUT 6—Answers
 Homework 3—It’s a Good Fit
 Activity 4—Which Model Fits Best?
 Activity 5—Verifying the Model as Quadratic
 Supplemental Activity 5—Introduction to the Parabola
 Homework 5—Quadratic Nature of Pairing Samples
 Activity 6—Solving the Problem
 Assessment—Working With Quadratics
 Unit Summary
 Mathematical Summary
 Key Concepts
Solution to Short Modeling Practice
 Mini Modeling Problem: Linear and Quadratic Equations
Solutions to Practice and Review Problems



TEKS Support


This unit contains activities that support the following knowledge and skills elements of the TEKS.



(1) (A)

X

(4) (A)

X

(1) (B)

X

(4) (B)

X

(1) (C)

X

   
       

(2) (A)

X

(8) (A)

 

(2) (B)

X

(8) (B)

 

(2) (C)

 

(8) (C)

 

(2) (D)

X

   
       

(3) (A)

X

(9) (A)

 

(3) (B)

X

(9) (B)

 

(3) (C)

X

   

The mathematical prerequisites for this unit are

The mathematical topics included or taught in this unit are

The equipment list for this unit is




Teacher Notes






"Simple Model"—Context Overview

The unit examines testing strategies for sampling pairs of individuals when cost is a consideration. Pooling of samples may require only one test and therefore save money, or may require three tests, losing money in the process. Whether pairs of samples should be initially pooled for testing depends on the probability that a single result is positive.

Students gather data by performing Monte Carlo experiments assuming different probabilities, and try to determine the mathematical model for this situation. They verify empirical results, deriving a theoretical relationship between the incidence of occurrence and the expected number of tests. Finally, they determine the "break-even" point, where it’s more cost-effective to test the samples individually than to pool the samples and test them in pairs.





Mathematical Development

Deciding when it is appropriate to pool samples is explored through discussion of the modeling process, concrete activity, and calculator simulation. The concept of expected value plays a major role in establishing the condition for the problem. Data analysis, using least squares regression, determines the constants that produce the best fit for linear, quadratic, and exponential functions. Examination of the residual patterns provides a basis for choosing which of these functions best describes the data. Probability area models are introduced to verify that the relationship between the probability of testing positive and the expected number of tests required is quadratic, and to determine the exact quadratic that models the situation. Finally, methods for solving the resulting quadratic equation are explored, and the problem of determining when it is cost-effective is solved.





Preparation Reading—"Anabolic Steroids: Use and Effect"

The reading provides some background into the problem of detecting steroid use and the physiological effects of taking anabolic steroids. The intent of the reading assignment is to generate a discussion of the use of steroids in the general population, and to build some understanding for the context of testing for steroid use. Prior to class, students should read the contents of this article, and be prepared to discuss some of the following questions:

In addition, you might want to facilitate a discussion about current events that involve steroid testing or steroid use. Possibilities include: the Tour de France bicycling competition from the summer of 1998, Mark McGwire admitting that he takes Androä , Ben Johnson having his life ban from track and field upheld, or steroid testing of athletes at the local high school or college. Specific topics might include how the tests are administered, the stigma of a positive result, fairness issues when one sport bans the use of a substance and another allows it, "professional" wrestling and steroid use, or even talk-show spectacles about people who have to have a leg amputated from excessive steroid use.

Another option for you might be to have students research the topic of steroid use, using the sources cited below, the Internet, or other materials they find on public health. Have them bring in an interesting article on the subject to share with the class, or present a report on their research at various points in the unit.

The following resources were used to generate the Preparation Reading, and contain information you may find useful.

Wright, J.E., and V. Cowart, Anabolic Steroids: Altered States, Carmel, Indiana: Benchmark Press, 1990.

Donohue, T., and N. Johnson, Foul Play: Drug Abuse in Sports, New York: Basil Blackwell Ltd, 1986.

Meer, J., Drugs and Sports, New York: Chelsea House Publishers, 1987.





Discussion—Modeling Process

Having brought the issue of steroid testing to the students’ awareness, you need to shift their thinking to the mathematical nature of testing samples in pairs. The discussion of the modeling process has probably taken place in previous units, and begins with the students developing a well-defined question. While students may be considering many questions, cost is a critical factor in trying to determine when to pool samples.


The second step of the modeling process is to identify key features or assumptions that will be part of the development of the model, and to introduce variables into the problem.


The third step of the modeling process is to begin to examine the mathematical nature of the problem and establish a relationship between the two variables.


At this point, you can do one of two things:
1) announce to students that they will begin their exploration by trying to find out if the relationship between p and E is linear, and begin Activity 1, or
2) you can ask them how they might verify their conjecture to the number of tests needed when the probability of testing positive in the population is 0.5, and then begin Activity 1.





Activity 1—"It’s Only a Test"

   

= 1, 2, 3, 4, 5, 7, 8



Materials needed (per group):

Handout 1: Data Table for Part 2 (one per student)

Opaque bag containing tiles, chips or beads—two colors, representing a positive or negative test, and containing the same number of each color object (to model the situation in which p = 0.50)

Graphing calculators with DRACULA program (Handout 3) loaded into memory or

Handout 2: Data Table for Part 4 (one per student)



Part 1 of the activity is a review of the previous discussion. There are several ways in which you can proceed. You might let the students struggle with the questions from Part 1, and then have the discussion of the modeling process, with students recording answers for future reference. You might have the discussion first, and give students several minutes to answer these questions to make sure they understand what was discussed. Depending on the time in class, you could have the discussion in class and send the students home to formally answer the questions as part of a homework assignment.

Part 2 is where the students actually do the experiment. Model for the students how to perform the simulation; have two students playing the roles of the athletes, and you be the tester. One student should draw an object and put it back; then the other student should draw an object and put it back. Upon consulting with each other, they should announce whether the pooled test would be positive or negative. If positive, you select one of the students for an individual test, and he or she tells you the result. Finally, you record the results of the test on one row of the data table on Handout 1—the actual status determined by the color of the object drawn, the results of the test determined by the pooled result and which individual is tested first. Check to see if they understand what they are doing before allowing the students to begin the experiment.

In Part 3, students have the opportunity to answer the question, "How many tests should we expect when the probability of testing positive is 0.50?" Groups must communicate their results so the class data can be averaged together. The easiest way to do this is to have each group post its average number of tests on an overhead transparency or the board. Discuss the answers to Part 3 after the students have had a chance to think about them and formulate their own responses. Make sure that they understand that the results of the simulation indicate that it is possible to have the average number of tests be 2, but it’s not likely that we should expect the number of tests to average out to be 2. We need more data to minimize the fluctuations caused by individual trial results.

Part 4 has two versions to provide a little flexibility for individual teachers’ situations. The DRACULA simulation uses a calculator program to simulate the testing game with a larger number of trials; the commands that make up the program are included in the unit material as Handout 3. The other version, which uses coin tosses to simulate the testing game, is provided in case the calculators aren’t available, or if time is too limited to fully explore the activity. Handout 2 data table is for use with this version. In either case, students should discover two things about this situation: 1) increasing the number of trials to the simulation gives less individual fluctuation to the results, and 2) whatever the expected number of tests actually is, it’s slightly higher than 2 and not equal to 2.





Supplemental Activity 1—"Middle of a Model" (Manipulative Version)

   

In case the graphing calculators are not available for Activity 2, an optional development is provided in which students repeat the testing game from Activity 1. Each group models one of the probabilities mentioned in Activity 1 by adjusting the relative numbers of objects from which they are drawing. They need to repeat the experiment 100 times, so that the data will be fairly patterned. Then they share their data as a class, and the rest of the activity is the same as Activity 1. Refer to those teacher notes for more details.





Homework 1—"Let’s Rumble"

   

This is a collection of questions related to the testing game introduced in Activity 1. In discussing it with students afterward, ask students to connect Question 5 to the simulation done in class. Had they done 1000 trials, instead of 10 or 100, there would be even less fluctuation in the results and they would be even closer to the actual expected number of tests.





Activity 2—"Middle of a Model"

   

This activity should begin with a continuation of the discussion of the modeling process. We’ve determined that a probability of 0.5 for testing positive doesn’t yield two tests as the expected number. But does that mean that the behavior is linear, and that particular point isn’t exactly on the line? Or is the behavior nonlinear? And, if it isn’t linear, the natural question that begs to be asked is: What is it?

At this point, you can distribute Activity 2 and the graphing calculators. Each group should be assigned a probability; 8 groups of students can run simulations with p = 0.1, 0.2, 0.3, 0.4, 0.6, 0.7, 0.8, and 0.9. Remember that we already have data for p = 0.5, and the expected number of tests is determined when p = 0 and p = 1. The directions instruct the students to enter ‘1000’ as the number of trials; this will take several minutes. You could build in some short "sponge" activity while the students are waiting, or you could tell them to enter in a smaller number, like ‘100.’ The smaller number of trials will get an answer back much faster, but the data set won’t be as smoothly patterned. For the purposes of this activity and the rest of the unit, it doesn’t matter which way you go.

After the groups have run the simulation, a member from each group should record the group’s result on an overhead transparency or the board, and all students should record the class data on their activity pages in the data table. In analyzing the pattern, students are asked to "fit" the data with a line that starts at (0,1) and ends at (1,3). You may need to review how to find the equation of a line from two points. Make sure students understand the difference between observed value (numbers gotten from the calculator simulation) and expected value (numbers gotten by evaluating the line equation for the various values of p).





Homework 2—"Expect to Be Worth Something"

   

The first part of the homework activity will require students to have a pair of dice. The activity is meant to provide them an opportunity to explore the problem’s various outcomes and payoffs before tackling the problem by mathematical analysis. An option for you is to do that part of the homework activity in class and let students do the analysis when they get home. Be sure that students understand the definition of probabilistic worth and expected value, or at least call to their attention the fact that the boldface words are actually definitions, and tell you how to proceed with the calculations.





Activity 3—Finally, A Model

   

Materials needed

Graphing calculators

Handouts 4, 5, and 6
(1 each per student)

The purpose of this activity is to use regression analysis to determine if a linear, exponential, or quadratic function is the "best" type to use as a model. We’re continuing to use the data that have been collected on the expected number of tests and the probability of steroid use. Remind students that the process used in the previous activity (find an equation, calculate predicted values, calculate residuals, and examine the residual plot) is going to stay the same, but this time, the equations will be determined from regression analysis.

If students have never done regression, or were unsure of what they were doing, take the opportunity to show them how to do the linear regression (Part 1). Even if they are pretty clear of the idea, they may have forgotten how to use the graphing calculators to do that task, and reviewing with them the steps on the calculator isn’t a bad thing. Having done that, the students should proceed to do parts 2 and 3 on their own, so that they can discover the nature of the quadratic relationship themselves. Encourage them to explore other types of mathematical functions as well, even though that’s labeled an optional part.

Students should continue to Part 4 and answer the questions based upon the work that they’ve done. Have a brief discussion at the end, asking students to:





Homework 3—It’s a Good Fit

   

There are several options for the teacher in proceeding with this assignment. If students have access to graphing calculators, this assignment is a nice opportunity for them to practice developing linear models and determining if they truly are linear. If the students don’t have access to graphing calculators outside class, several options are still available.

  1. Have each student make a hand drawing of the data set, draw a line on the graph that is a reasonable "best" fit line, then determine the equation of that line from two points on the line. They can use the equation to calculate the predicted values and the residuals, and finally make a residual plot to determine if the data set is linear. Depending on your students, doing both data sets in this fashion might be tedious; however, students need to see the contrasting results to reinforce whether data is linear. You could assign one problem to half the class and the other problem to the other half of the class.


  2. Have students do all the data analysis in class with the calculators and record their table values; they would then go home to make the graphs and write their conclusions.


  3. Have it be an optional reteaching activity to be assigned as in-class work if they don’t understand the process well enough from Activity 3.




Activity 4—Which Model Fits the Best?

   

Students work in groups to analyze several data sets out of the context of steroid testing, and must determine the nature of the mathematical relationship among the data sets. There are a couple of ways to proceed with this activity:





Activity 5—Verifying the Model as Quadratic

   

 = 16, 17, 18



Materials needed

Transparency 1

This activity illustrates the idea of probability as an area. The specific example, which is the focus for Part 1, is the original testing problem, in which the assumption is that 50% of the population is taking steroids. Depending on your students, you can either lead the class through a discussion of the questions that form Activity 1 (with them recording the answers as they go) or let them work on that part for about 10-15 minutes and have a discussion of the material at that time. It’s important that the students identify each area and its meaning with respect to the probability conditions. Be sure that they verbalize the condition and understand why the values are being multiplied. Then proceed to Part 2, in which the problem is generalized to a condition that the probability of steroid use in the population is p, instead of 0.5. Students may struggle with the algebraic simplifications needed for Questions 5 and 7, but that’s the "punch line" to verifying the model as being quadratic, and well worth the effort. If needed, use Transparency 1 to review the questions from Part 2 in the follow-up discussion.

Remind your class that the probability that each of two events happens is calculated as follows:

P(A and B) = P(A) · P(B½A).

That is, "the probability of event A and event B both happening is equal to the product of the probability of event A times the probability of event B knowing that event A has already happened. If the probabilities of event A and event B are not related (independent events), the probability of "B given that A has occurred" is simply the probability of event B. It is only because we have assumed that the two individual tests are independent that we can state:

P(A and B) = P(A) · P(B).

Note: As you saw in the unit Imperfect Testing, the area model of probability can also be represented using probability trees. Though they are not discussed on the student pages of this unit, you may want to encourage students to use both methods on some problems.





Supplemental Activity 5—Introduction to the Parabola

   


Materials needed

Handout 7

Transparency 2

This activity is designated as a supplemental activity only because it doesn’t specifically address the problem of steroid testing and the cost-effectiveness of pooling blood samples. However, it is strongly recommended that a day be spent working on the various parts of this activity if the students have never seen a quadratic function, or if they haven’t mastered the concepts of translations and stretches of quadratics and the distinction between standard-form and vertex-form equations for parabolas.

In part 1, students concentrate on the graphs of the parent function y = x2 and translations on that function. Introduce them to the parent function by having them fill in the table of values and plot the points on the graph grid on Handout 7. Recall the concepts of domain and range, and reinforce the range limitation by asking the class to find the x-value that yields y = –1. Encourage discussion about "pairs" of points (x, y) and (–x, y), introducing the concept and terminology of line symmetry. Include several fractions and decimals in your table. Graph the function and label the vertex. (Use Transparency 2 to help you with the instruction, if needed). Use the graphing calculator to verify the drawing, and use the TRACE feature to verify the values recorded in the table.

Allow students to work in their groups to explore the transformations. There are four different transformations, so each student in the group can explore one of the shifts. Students should come together as a group to discuss their findings, and then answer Question 3. End the work for Part 1 by reviewing:
1) the shape of the parent parabola graph (y = x2),
2) the rules governing lateral translations of functions and
3) the domains and ranges of parabolas in the form yk = (xh)2.

In Part 2, students concentrate on how the graph of the parent function y = x2 is affected by stretch transformation and combinations of stretches and shifts. Once again, the work is divided into four parts, and each student in the group should explore two examples on his or her own. Then, the groups should collaborate to compare their results and draft responses to Questions 2 and 3. In the class discussion that follows the group explorations, ask if such stretch transformations change the domain or range of the parabola. (A horizontal stretch transformation would never change the domain or range, but a vertical stretch would change the range for parabolas whose vertex points are not on the x-axis.) Ask students if it matters whether one does the shift first and then the stretch, or the stretch first and then the shift. Close the discussion by reviewing how stretches in in both vertical and horizontal directions can be done.

In Part 3, students are asked to examine the two forms for the equation of a parabola: standard form and vertex form. They explore the parallel manipulation of the equation and its graph to discover how to change the equation from one form to another. Allow students to work through Questions 1-3, then discuss their findings before proceeding to Question 4, which gives students an additional opportunity to practice.





Homework 5—Quadratic Nature of Pairing Samples

   

This assignment asks students to examine the expected value calculation again, first using a specific value for the probability of testing positive (0.3) and then examining the model developed. Students are led to examine how changes in the probability affect the model geometrically (how do the various shapes adjust?) and analytically (how do the various pieces of the equation behave?).

A really effective way of exploring Question 1 would be to construct an area model on the computer using a dynamic drawing program like Geometer’s Sketchpad. Display the areas of the three regions representing the three different tests, and the probabilities associated with those regions. Then drag the intersection point from the middle of the square (Point E) toward the upper left or lower right corner and watch what happens to the areas and the calculations!





Activity 6—Solving the Problem

   

Closure on the problem of the cost-effectiveness of pooling blood samples is reached by examining the solution to the questions: "What is the break-even point? When is it cheaper to pool blood samples? When is it cheaper to test samples individually?" Various methods are explored for solving the quadratic equation describing the condition that our model should have a value equal to 2. Part 1 is pretty straightforward, using the calculator’s capability to determine intersection of two graphs. You might want to show your students the CALC menu feature that locates the intersection of two curves at this same time.

Part 2 examines the method of solving a quadratic equation called "completing the square." Students are led through this process and then asked to solve the modeling problem by applying their method. Let students work through Part 2; discuss individual aspects of the process as needed. After students solve the problem in Question 10, have them take a minute to compare their answers to the ones obtained in Part 1; they should verify the solution.

Finally, in Part 3, students are asked to generalize the work they did in Part 2 and to derive the quadratic formula. Let them work through Question 1, and have them tell you the steps involved. Make sure that there is no ambiguity or confusion in their process or thinking. Then let them write their answers to Question 2 before proceeding to the general equation. Discuss their work on Question 3 as needed, before letting the students go on to Question 4.





Homework 6—Practice With Quadratics

   

This assignment should be an opportunity for students to practice what they have discovered in the previous work, and a chance for you to see which students really "got it" and which are still struggling with the algebra developed in this last activity.





HANDOUT 1—Activity 1 Part 2

Data Table

Trial No.

 

1st Person’s Actual Status

2nd Person’s
Actual Status

 

Result of Pooled Test

Result of 1st Individual Test
(if needed)

Result of 2nd Individual Test
(if needed)

 

Number of Tests Needed

1

                 

2

                 

3

                 

4

                 

5

                 

6

                 

7

                 

8

                 

9

                 

10

                 


 

Average Number
of Tests:

 




HANDOUT 2—Activity 1 Part 4

Data Table

Trial
No.

Number
of Tests

 

Trial
No.

Number
of Tests

 

Trial
No.

Number
of Tests

 

Trial
No.

Number of Tests

1

   

26

   

51

   

76

 

2

   

27

   

53

   

77

 

3

   

28

   

53

   

78

 

4

   

29

   

54

   

79

 

5

   

30

   

55

   

80

 

6

   

31

   

56

   

81

 

7

   

32

   

57

   

82

 

8

   

33

   

58

   

83

 

9

   

34

   

59

   

84

 

10

   

35

   

60

   

85

 

11

   

36

   

61

   

86

 

12

   

37

   

62

   

87

 

13

   

38

   

63

   

88

 

14

   

39

   

64

   

89

 

15

   

40

   

65

   

90

 

16

   

41

   

66

   

91

 

17

   

42

   

67

   

92

 

18

   

43

   

68

   

93

 

19

   

44

   

69

   

94

 

20

   

45

   

70

   

95

 

21

   

46

   

71

   

96

 

22

   

47

   

72

   

97

 

23

   

48

   

73

   

98

 

24

   

49

   

74

   

99

 

25

   

50

   

75

   

100

 


Average Number of Tests:

 




HANDOUT 3—Dracula Program

(written for a TI-82 calculator)

PROGRAM:DRACULA

:0® C

:Disp "WHAT IS P?"

:Prompt P

:Disp "HOW MANY TRIALS?"

:Prompt N

:For(I,1,N,1)

:rand® A

:rand® B

:If A> P and B> P

:C+1® C

:If A< P

:C+3® C

:If A> P and B< P

:C+2® C

:End

:Disp "AVERAGE NUMBER"

:Disp "WAS"

:Disp C/N

:Stop





HANDOUT 4—Linear Regression for Activity 3

Equation: E =


Prob.
(p)

Observed
Value (E)

Expected
Value (E)


Errors
(Residuals)

0.0

     

0.1

     

0.2

     

0.3

     

0.4

     

0.5

     

0.6

     

0.7

     

0.8

     

0.9

     

1.0

     

 

Exponential Regression





HANDOUT 5—Exponential Regression for Activity 3

Equation: E =


Prob.
(p)

Observed
Value (E)

Expected
Value (E)


Errors
(Residuals)

0.0

     

0.1

     

0.2

     

0.3

     

0.4

     

0.5

     

0.6

     

0.7

     

0.8

     

0.9

     

1.0

     

 

Exponential Regression




HANDOUT 6—Quadratic Regression for Activity 3

Equation: E =


Prob.
(p)

Observed
Value (E)

Expected
Value (E)


Errors
(Residuals)

0.0

     

0.1

     

0.2

     

0.3

     

0.4

     

0.5

     

0.6

     

0.7

     

0.8

     

0.9

     

1.0

     

 

Exponential Regression





HANDOUT 7—Graph Grid for Supplemental Activity 5




Annotated Student Materials






Preparation Reading—Anabolic Steroids: Use and Effect

Anabolic steroids, synthetic compounds created to act like the male hormone testosterone, remain in the forefront of discussions regarding athletes and sports. Many athletes such as bodybuilders, weight lifters, runners, swimmers, and football players believe that steroids will give them strength advantages.

Actually, the development of anabolic steroids was an important scientific breakthrough. There are, in fact, four legal uses for the drugs. They are effective in treating certain forms of anemia, some kinds of cancer, pituitary dwarfism, and serious hormone disturbances.

By virtue of their specific chemical structure (which is totally different from any vitamin or amino acid), anabolic steroids can stimulate the genetic apparatus (DNA) within the cell of the nucleus. The DNA reacts to this stimulus by directing the production of specific new proteins. It is very important to understand that the biological response that occurs is dependent on the location and number of receptors. If the receptor is muscle, there will be a tissue-building effect. If the receptor is in the brain, there may be noticeable psychological and even behavioral effects. There are, incidentally, more receptors in "sexual" tissue than in muscle, which raises concern about the prospect of hypertrophy or cancer of the prostate gland.

Although the development of bigger muscles is a result of the anabolic component of the steroid, there is also an androgenic associated with the steroid. Androgenics cause growth of facial hair, lengthening of the vocal cords that results in the voice "breaking", acne, and the premature closure of the space between parts of the body’s long bones. Anabolic steroids can have harmful effects on the liver and the cardiovascular system (causing a higher risk for developing high blood pressure and blood-clotting problems). There is growing concern that flooding a young adolescent with synthetic sex hormones may disrupt not only physical changes but also normal psychological and emotional maturation that occur during adolescence.

Several studies have placed the incidence of anabolic-androgenic steroid use among high-school students at 6 to 7%. Figures as high as 11 percent of 11th-grade boys using steroids have been reported, meaning that between 250,000 and 500,000 high-school students have used steroids. Even more disturbing are reports that approximately 40 percent who have used steroids have used five or more cycles, and approximately 40 percent began using steroids before the age of 16. As expected, the majority of teenage users are participants in school athletics, but as many as a third of the male users are not involved in sports and take these drugs in an attempt to enhance their appearance.

There have been only a few surveys among the male college population, all of which show an estimated 2 percent usage. However, among male college athletes, the estimated usage rises to between 5 and 17%. The problem is not restricted to males. As many as 1 to 2 percent of senior high school girls are users. Between 3 and 5 percent of Olympic and professional female athletes have admitted to using them at some time during their careers.

Source: Write, J. E., and V. Coward, Anabolic Steroids: Altered States, Carmel, Indiana: Benchmark Press, 1990.





Activity 1—It’s Only a Test

   

 = 1, 2, 3, 4, 5, 6, 7, 8

Part 1: Introduction

  1. What is the central problem that we’re trying to solve?
  2. We’re trying to determine when it’s cheaper to "pool" blood samples in testing for steroid use.

  3. What are the variables in this problem?
  4. The likelihood, p, that someone is actually using steroids in the population, and the expected number of tests, E, that would be needed.

  5. When one tests for steroid use by "pooling" two blood samples, how many tests do you expect to have to make?
  6. The number of tests will always be one, two, or three tests.

  7. In testing for anabolic steroid use by pooling two samples together:


    1. If nobody is taking steroids, how many tests do you expect to have to make?
    2. One test every time, because the pooled sample would come up negative (clean).

    3. If very few people are taking steroids, how many tests do you expect to have to make?
    4. Most of the time, only one test would be needed. Once in a while, a second (or third) test would be required. The average number would be close to one.

    5. If almost all the people are taking steroids, how many tests do you expect to have to make?
    6. Most of the time, three tests would be needed–the pooled sample test would come up positive, and the first individual test would come up positive. Some of the time, only two tests would be needed, since the first individual test might be negative. The average number would be close to three.

    7. If everybody is taking steroids (and what a crazy world that would be!), how many tests do you expect to have to make?
    8. Three tests every time, because the pooled sample would always come up positive, and the first individual test would also come up positive.

  8. If exactly half the population were taking steroids, how many tests do you expect to need for each sample pair? Explain your reasoning.
  9. I guess that two tests would be needed. Since p = 0.5 is halfway between none of the people and all of the people, I expect that halfway between one test and three tests should be my answer.

  10. What modeling assumptions did you make in answering Question 5?
  11. The nature of the relationship between the two variables is linear; the expected number of tests increases proportionally to the population occurrence.



Part 2: Simulation

In groups, you are going to explore the problem of what happens when half the population is taking steroids. Two people will play the role of Olympic athletes, who have to be tested for steroid use. The third person will be the lab worker performing the test. Go through the simulation 10 times, recording the actual status of each athlete, the result of the pooled test, and the result of any individual tests, in the data table provided for this activity. Use marks of ‘+’ for a positive result or ‘–’ for a negative result. Then, determine the number of tests needed for each trial, and finally the average number of tests needed for the 10 trials.



Part 3: Analyzing Class Results

In the table below, record each group’s results for the average number of tests needed for 10 trials. Answers will vary; sample results provided in the table.

Group

1

2

3

4

5

6

7

8

9

10

Avg. No. of Tests

2.3

1.9

1.8

2.4

2.4

1.9

2.5

2.4

2.2

2.5

  1. As the tester, when would you be saving money in pooling samples?
  2. When the average number of tests was under 2 tests, since testing the two athletes individually would always require 2 tests.

  3. When half the population is using steroids, is it possible to save money by pooling samples in testing for steroid use?
  4. Yes. There were specific trials that required only 1 test, and, in certain groups, the average number < 2 tests.

  5. In this situation, is it likely that you would save money by pooling samples?
  6. Probably not. There were more group results that were > 2 tests, and the average of those results was » 2.2 tests. However, given the range of results, it’s hard to be sure. Some of the groups may have been very unlucky in choosing which of the individual samples to test first when the pooled sample came up positive.

  7. Think back to the simulation. Are there problems or limitations in the way in which we collected our data that might affect the results that we got?
  8. Possible answers include: not replacing the first person’s chip before drawing the second, having the chips stick to each other, nonuniform weights or shapes for the objects drawn, not mixing or shaking up the objects well enough, or looking at the objects as they are being drawn.

  9. How could we modify our simulation, to better understand what’s going on in the situation where half the population is taking steroids?
  10. Do more trials. Specifically, if each group did the test 100 times, the range of results might narrow down a bit, and the average of those results would get close to the "true" results.



Part 4: DRACULA Simulation of Situation

Each group will run the DRACULA program. When asked the first question, "What is P? P = ?", enter in the value ‘0.5’ (remember, we’re studying what happens when half the population is taking steroids). When asked the second question, "How many trials? N = ?", enter in the value ‘100’. After a few seconds, the calculator will provide the average number of tests needed.

In the table below, record each group’s results for the average number of tests needed for 100 trials.

Answers will vary; sample results provided in the table.

Group

1

2

3

4

5

6

7

8

9

10

Avg. No. of Tests

2.16

2.21

2.31

2.04

2.24

2.37

2.33

2.16

2.26

2.18

  1. Do you think it’s cheaper to pool samples in testing for steroid use when half the population is taking steroids? Explain.
  2. No. The range of possible results when only 10 trials were taken was [1.8, 2.5]. When the simulation was repeated with 100 trials, the range became [2.04, 2.37]. In every situation, the average number of tests was greater than 2, and, when you consider the total number of tests (n = 1000), the average » 2.23. Incidentally, the average of the total number of tests before was » 2.2 as well. I thought that the results were supposed to be an average of 2 tests. I wasn’t sure if the way the simulation was being run, or a streak of bad luck, might be causing me to get a different answer. But now, I’m convinced that it’s not supposed to average to 2 tests, and, if I needed even more convincing, I would simply repeat the simulation again, doing an even larger number of trials.

  3. In modeling the situation, two questions come up as a result of this activity. Review the problem and the original assumptions, and see if you can come up with those questions.


    1. The problem of "when is it cheaper to pool samples?" is now narrowed to a range of possible probabilities between [0, 0.5]. So a natural question to ask would be: "What is the probability of steroid use in the general population where the break-even point occurs?" or "At what probability would the number of tests be expected to be exactly two tests?"


    2. The assumption is that the relationship between p and E is not linear, or else we would have found that the number of tests would average to be 2 tests. So a natural question to ask at this point would be: "What is the exact nature of the relationship between p (the incidence of steroid use in the population) and E (the expected number of tests required in pooling two samples)?"


Part 5: Simulation by Tossing Pennies

You will repeat the simulation from today’s class, only this time using pennies. We’ll agree that a penny that comes up ‘tails’ will be a negative test result, and a penny that comes up ‘heads’ will be a positive test result. You will need someone else to play the role of the tester, but you might have figured out there is a shortcut to doing this:

Record the results of the coin-toss simulation in the data table provided. Calculate the average number of tests needed for your simulation, and come to class tomorrow ready to share your results.

In the table below, record 10 students’ results for the average number of tests needed for 100 trials.

Answers will vary; sample results provided in the table.

Group

1

2

3

4

5

6

7

8

9

10

Avg. No. of Tests

2.16

2.21

2.31

2.04

2.24

2.37

2.33

2.16

2.26

2.18

  1. Do you think it’s cheaper to pool samples in testing for steroid use when half the population is taking steroids? Explain.
  2. No, the range of possible results when only 10 trials were taken was [1.8, 2.5]. When the simulation was repeated with 100 trials, the range became [2.04, 2.37]. In every situation, the average number of tests was greater than 2, and, when you consider the total number of tests (n = 1000), the average » 2.23. Incidentally, the average of the total number of tests before was » 2.2 as well. I thought that the results were supposed to be an average of 2 tests. I wasn’t sure if the way the simulation was being run, or a streak of bad luck, might be causing me to get a different answer. But now, I’m convinced that it’s not supposed to average to 2 tests, and, if I needed even more convincing, I would simply repeat the simulation again, doing an even larger number of trials.

  3. In modeling the situation, two questions come up as a result of this activity. Review the problem and the original assumptions, and see if you can come up with those questions.


    1. The problem to solve becomes "What is the probability of steroid use in the general population where the break-even point occurs?" or "At what probability would the number of tests be expected to be exactly two tests?"
    2. The modeling question becomes "What is the exact nature of the relationship between p (the incidence of steroid use in the population) and E (the expected number of tests required in pooling two samples)?"




Supplemental Activity 1—Middle of a Model (Manipulative Version)

   

In Activity 1, we assumed that the probability was 0.50 for a person selected at random to be taking steroids. This activity will explore how different probability rates affect the number of tests required.

Your group will be assigned a particular probability to use, and then you will do the Testing Game again. Modify the number of each kind of marker used, so that it models the probability your group is working with. Then do the simulation 100 times and determine the average number of tests. When done, record your results in a table and share them with other groups according to directions given.

Record the class results for the average number of tests needed for 100 trials in the space below the appropriate probability of occurrence. Use a table similar to the data table in Handout 2, Activity 1.

Answers will vary; sample data provided in the table.

Probability (p)

0.0

0.1

0.2

0.3

0.4

0.5

Avg. No. of Tests (E)

1.00

1.25

1.56

1.85

2.06

2.24

Probability (p)

0.6

0.7

0.8

0.9

1.0

Avg. No. of Tests (E)

2.41

2.69

2.79

2.86

3.00



  1. Using the data collected, make a graph of Expected Value (E) vs. probability (p).
  2. In creating a model, the first clue is the pattern showing up in the scatterplot. Describe the graph; what kind of pattern seems to relate these quantities?
  3. It appears to be fairly linear, starting at (0,1) and going to (1,3).

  4. You may have answered the previous question by saying that the pattern looks linear. Let’s assume that it is. The next step is to describe the pattern in mathematical terms.


    1. Draw a line that starts at the first point (0,1), and goes to the last point (1,3). Find the slope of that line.
    2. Slope = (3 – 1) / (1 – 0) = 2

    3. Write the equation for the line in slope-intercept form; this will be our first model for these data.
    4. E = 2p + 1 or y = 2x + 1

  5. Now, we have to see how good a job our model does in describing the observed data. Fill in the table below; the actual values are the ones found from the simulation and recorded on the previous page. The equation above tries to "predict" those data, and so predicted values are found by taking each value for the probability and using it in the equation you just derived. Finally, errors (residuals) are how far the predictions are from the observed data, so they are found by subtraction:
  6. Error = Observed – Expected

    Probability
    (p)

    Observed
    Value (E)

    Expected
    Value

    Error
    (residual)

    0.0

    1.00

    1.00

    0.00

    0.1

    1.25

    1.20

    + 0.05

    0.2

    1.56

    1.40

    + 0.16

    0.3

    1.85

    1.60

    + 0.25

    0.4

    2.06

    1.80

    + 0.26

    0.5

    2.24

    2.00

    + 0.24

    0.6

    2.41

    2.20

    + 0.21

    0.7

    2.69

    2.40

    + 0.29

    0.8

    2.79

    2.60

    + 0.19

    0.9

    2.86

    2.80

    + 0.06

    1.0

    3.00

    3.00

    0.00

  7. Finally, in the grid provided above, make a graph of errors vs. probability (p).


  8. A good model will show a random pattern of residuals, with a balance of points above and below the zero line. Does the model you’ve developed have those properties?
  9. No. The residual plot has no points below the zero line and is not showing a random pattern.





Homework 1—Let’s Rumble

   
  1. Dante Takum is trying to arrange tag-team matches for his wrestlers, half of whom are currently taking anabolic steroids. The event organizers insist on having the wrestlers tested for steroid use. Each test costs $160.


    1. If you wanted to test each wrestler individually, what would the total cost be?
    2. (2)($160) = $320

    3. Based on the work done in Activity 1, how much would it cost (on average) to test the pooled sample from the two wrestlers on the tag team first, and then test individuals if needed?
    4. Based on an average of 2.23 tests per pair, it would cost (2.23)($160) = $356.80

  2. Carefully consider the following table that represents results from repeating the testing simulation from Activity 1. Was the game properly played? How do you know?
  3. Trial Number

    1

    2

    3

    4

    5

    6

    7

    8

    9

    10

    Number of Tests

    1

    3

    3

    1

    1

    1

    3

    1

    1

    3

    Probably not. The situation in which two tests were required never came up.

  4. In Activity 1, we modeled a situation in which half the population was taking steroids; we used two types of markers and had the same number of each color. If green means "steroid user," determine the smallest number of blue and green markers needed to model a population in which the probability of steroid use is:


    1. 75%
    2. 3 green, 1 blue

    3. 15%
    4. 3 green, 17 blue

    5. 23%
    6. 23 green, 77 blue

  5. Suppose you want to model a situation in which the probability of an individual taking steroids is 25%. Use the same color scheme as in the previous question.


    1. What is the smallest number of chips that you would need?
    2. 1 green and 3 blue

    3. If you drew a blue marker on the first draw, and did not replace the chip, what would be the probability of drawing a blue on the second draw? A green on the second draw?
    4. 2/3 or 0.67; 1/3 or 0.33

    5. If you drew a green on the first draw and did not replace the chip, what is the probability of drawing a green on the second draw? A blue on the second draw?
    6. 0%; 100%

  6. Suppose that you use a coin to simulate the situation in which the probability is 0.5 that a person in the general population is taking anabolic steroids, and that you use a "fair coin."


    1. Would that mean that, if you flipped the coin twice, you would always get one head and one tail?
    2. No, but that would be the most likely outcome.

    3. If you flipped the coin a million times, would you always get 500,000 heads and 500,000 tails?
    4. No, but that (very unlikely) event would still be the most likely outcome.

    5. Which do you think is more likely: The number of heads in 10 tosses would be between 4 and 6, or the number of heads in 100 tosses would be between 40 and 60?
    6. The second option is more likely.

    7. Suppose a fair coin was flipped ten times and happened to land heads eight of those times. If you flipped the coin another 100 times, how many heads would you expect to get?
    8. Approximately 50. This would naturally draw the overall percentage closer to 50% (i.e. 58 out of 110, if exactly 50 of the next 100 tosses are heads). Be careful not to assume that the data will automatically compensate for the initial results.

  7. In your simulation in Activity 1, it was important to replace the drawn chip before drawing another. Can you explain why? Can you think of a different situation where it would be important not to replace the chip?
  8. Not replacing the drawn chip changes the probability of drawing a green chip on the second draw. We want to model the situation in which both athletes have the same probability of testing positive.

    Many states have lotteries in which a collection of six numbers is chosen from 44 to 51 different possible numbers. In that situation, the numbers shouldn’t be replaced.





Activity 2—Middle of a Model

   

 = 6, 9, 10, 11

In Activity 1, we assumed that the probability was 0.50 for a person selected at random to be taking steroids. This activity will explore how different probability rates affect the expected number of tests required.

The teacher will assign a particular probability to each group, which will then run the DRACULA program. When prompted by the calculator program, give it the probability (decimal, please!), and set the number of trials to be 1000. Settle in for a couple of minutes; after the calculator has done all those trials, record your results in the table below and share them with other groups according to directions given.

In the tables below, record the class results for the average number of tests needed for 1000 trials in the space below the appropriate probability of occurrence:

Answers will vary; sample data provided in the table.

Probability (p)

0.0

0.1

0.2

0.3

0.4

0.5

Avg. No. of Tests (E)

1.000

1.300

1.554

1.879

2.042

2.237

Probability (p)

0.6

0.7

0.8

0.9

1.0

Avg. No. of Tests (E)

2.418

2.632

2.727

2.873

3.000

  1. Using the data collected, make a graph of Expected Value (E) vs. probability (p).
  2. In creating a model, the first clue is the pattern showing up in the scatterplot. Describe the graph; what kind of pattern seems to relate these quantities?
  3. It appears to be fairly linear, starting at (0,1) and going to (1,3).

  4. You may have answered the previous question by saying that the pattern looks linear. Let’s assume that it is. The next step is to describe the pattern in mathematical terms.


    1. Draw a line that starts at the first point (0,1), and goes to the last point (1,3). Find the slope of that line.


    2. Slope = (3 – 1) / (1 – 0) = 2

    3. Write the equation for the line in slope-intercept form; this will be our first model for these data.
    4. E = 2p + 1 or y = 2x + 1

  5. Now, we have to see how good a job our model does in describing the observed data. Fill in the table below; the actual values are the ones found from the simulation and recorded on the previous page. The equation above tries to "predict" those data, and so predicted values are found by taking each value for the probability and using it in the equation you just derived. Finally, errors (residuals) are how far the predictions are from the observed data, so they are found by subtraction:
  6. Errors = Observed – Expected

    Probability
    (p)

    Observed
    Value (E)

    Expected Value

    Error
    (residual)

    0.0

    1.000

    1.000

    0.000

    0.1

    1.300

    1.200

    + 0.100

    0.2

    1.554

    1.400

    + 0.154

    0.3

    1.879

    1.600

    + 0.279

    0.4

    2.042

    1.800

    + 0.242

    0.5

    2.237

    2.000

    + 0.237

    0.6

    2.418

    2.200

    + 0.218

    0.7

    2.632

    2.400

    + 0.232

    0.8

    2.727

    2.600

    + 0.127

    0.9

    2.873

    2.800

    + 0.073

    1.0

    3.000

    3.000

    0.000

  7. Finally, in the grid provided above, make a graph of errors vs. probability (p).


  8. A good model will show a random pattern of residuals, with a balance of points above and below the zero line. Does the model you’ve developed have those properties?
  9. No. The residual plot has no points below the zero line and is not showing a random pattern.





Homework 2—Expect to Be Worth Something

   
  1. A dice game is defined by two rules:


  2. Find the expected value.

    1. Play the game several times with a friend or a family member. Describe the strategy that you think would be best to apply to this game.
    2. Answers will vary. Possibilities include "12," since it comes up in a lot of different ways; "36," since it is the biggest outcome.

    3. Now, let’s apply some mathematics. First, we need to determine what the possible outcomes are when you multiply the two dice together, and how often they come up. Fill in the chart provided below with the winning answers:


    4. Outcome for Dice 2

       

      1

      2

      3

      4

      5

      6

      1

      1

      2

      3