Page 90 - Genetics_From_Genes_to_Genomes_6th_FULL_Part3
P. 90

384    Chapter 11    Analyzing Genomic Variation




                 TOOLS OF GENETICS                              Blue DNA: © MedicalRF.com


                 The Lod Score Statistic
                 The Lod score is a mathematical answer to the question: How   1/8. The probability of seven parentals and one recombi-
                 much more likely is it that the allele transmission pattern seen in   nant in the particular birth order PPPPPPPR is:
                 a pedigree will occur if the loci are linked at a given recombina-         7  7  1  1  1

                 tion frequency (RF) less than 50%, than if they are not linked?   P(RF 12.5% ) =  ( 8 ) ( 8 )  ≈  20
                 The Lod score, as its name implies (log of the odds) is the loga-       A generalized formula for calculating the Lod score
                 rithm of the ratio between these two probabilities:
                                                                         numerator is:
                                                                                          #P
                         P(obtaining observed results if loci are linked at a given RF)  (1 − RF obs )  × (RF obs ) #R
                  Lod = log
                        [   P(obtaining observed results if loci are unlinked)  ]
                                                                        where RF obs  is the RF indicated by the data, #P is the num-
                    Here, we illustrate the Lod score calculation for the pedi-  ber of parentals, and #R is the number of recombinants.
                 gree in Fig. 11.20a. The pedigree suggests that the NF gene is     4.  Calculate the likelihood ratio. This is simply the ratio of
                 linked to a particular SNP on chromosome 17. The calculation   the values you found in steps 2 and 3. For this example,
                 will allow us to determine our degree of confidence in this pre-           1     1
                 liminary conclusion.                                      P(RF 12.5% ) P(RF 50% ) =  ∕  = 12.8
                                                                                          ( 20 ) ( 256 )
                   1.  Tabulate which progeny are parental and which are
                      recombinant. In Fig. 11.20a, you can see that the first        This likelihood ratio means that it is 12.8 times more likely
                    seven children in generation III have the parental (P) con-  that the NF gene and the SNP are linked with RF = 12.5%
                    figuration of alleles and that only child III-8 has the recom-  than that they are not linked (RF = 50%).
                    binant (R) configuration. We’ll abbreviate these data as     5.  Calculate the Lod score. The Lod score is simply the
                    PPPPPPPR.                                          base 10 logarithm of the likelihood ratio. For the example
                   2.  Calculate the Lod score denominator. If two loci are un-  in Fig. 11.20a:
                    linked, it is equally likely that any one child will be P or R   Lod score = log(12.8) = 1.1
                    (that is, the RF = 50%). The probability of P is thus ½, and     6.  Interpret the Lod score. The convention among human
                    the probability of R is also ½. The probability of obtaining   geneticists is that a Lod score ≥3 (that is, a likelihood ratio
                    children in the particular birth order PPPPPPPR if the NF   ≥1000) is required to be confident of linkage. The Lod
                    gene and the SNP locus are unlinked is:            score of 1.1 indicates that the data in Fig. 11.20a are
                                                                         insufficient to conclude that NF and SNP1 are linked.
                                           1  8  1
                                 P(RF 50% ) =  =
                                         ( 2 )  256                Important points about Lod scores:
                                                                    •  The Lod score determined by assuming the precise RF
                      You can see that a generalized formula for this part of the
                    calculation is simply:                             implied by the data will always be the maximum Lod score
                                                                       obtainable for the data set. 
                       n
                     (½) , where n is the total number of tabulated individuals.  •  For a single pedigree, Lod scores can be calculated for
                                                                       any RF value less than 50%. A Lod score ≥3 indicates that
                   3.  Calculate the Lod score numerator. Loci could be linked   the data obtained are compatible statistically with the
                    if the RF is any value less than 50%, but the calculation     particular distance (less than 50 m.u.) being tested. 
                      requires us to assume an RF value. The pedigree in   •  Likelihood ratios are converted into Lod scores because
                    Fig. 11.20a indicates an RF of 1/8 = 12.5%, so we will   Lod scores calculated for the same RF value in different
                    use this as our best current estimate. With RF = 1/8, the   pedigrees may simply be added to see whether a Lod of
                      expected frequency of P progeny is 7/8, and R progeny is   3 can be obtained.



              DNA marker are genetically linked. The Lod score is used   convention adopted by human geneticists is that a Lod
                                             2
              in human genetics instead of the χ  statistic because the   score greater than or equal to 3 indicates two loci are linked.
              Lod score better handles a small number of data points   A Lod score of 3 means that it is 1000 times more likely
              while allowing the data obtained from many different ped-  that the two loci are linked than that they are not (because
              igrees to be combined.                               3 = log1000). The beauty of the Lod score statistic is that
                  The Lod score statistic is calculated from the ratio of   because it is a log function, the Lod scores from different
              two probabilities: the probability of obtaining a particular   pedigrees may simply be added, so researchers will know
              set of results in a pedigree if two loci are linked (assuming   when  they have enough  data  to  conclude  that  a  disease
              a particular RF value), and the chance of observing the     allele is linked to a specific marker.
              same results if the loci are unlinked. The Lod score statistic   The  Tools of  Genetics  Box  entitled  The  Lod  Score
              is the base 10 logarithm (log) of this likelihood ratio. The     Statistic illustrates how to calculate a Lod score, using as
   85   86   87   88   89   90   91   92   93   94   95