diff --git a/dev/.markdownlint.json b/dev/.markdownlint.json new file mode 100644 index 0000000..6a8046b --- /dev/null +++ b/dev/.markdownlint.json @@ -0,0 +1,3 @@ +{ + "MD041": false +} diff --git a/dev/beefblup-cli/index.html b/dev/beefblup-cli/index.html new file mode 100644 index 0000000..8222df6 --- /dev/null +++ b/dev/beefblup-cli/index.html @@ -0,0 +1,4 @@ + +- · beefblup

beefblup Command Line Interface (CLI) documentation

A work in progress

Notice: This document is a draft for what the command-line interface for beefblup would look like as of version 1.0, if beefblup was even a command-line application to begin with (it's not). It is modeled (loosely) after the man page format. It is not intended to be taken seriously, but instead to serve as a useful thought experiment and brainstorming ground on the future of beefblup. Please use it if it clarifies things for you. If it doesn't, ignore it.

Input file

beefblup requires a very specific format of input file. The format may be in comma-separated values (CSV) or Excel 2007+ (XLSX) format. CSV files should not be quoted (and therefore cannot have commas within cell values). Other formats may be forthcoming.

A beefblup data file must have at least six columns appearing in this order:

  • ID
  • Sire ID
  • Dam ID
  • Birthdate
  • Fixed effect(s)
  • Response variable(s)

The first row always contains column names. The values of column names are unimportant for the first four columns, as they will always be treated the same regardless of the name. The generated report will use the column names of fixed effects and response variables as given.

Each fixed effect should have its own column, to as many as are needed. There is no limit to the number of fixed effects as defined by beefblup, however its dependencies might have some. The same rules apply to response variables.

Unknown values should be left blank (,,). Do not substitute null placeholders (e.g. NULL, NA, 0, nothing, undefined, etc.) for unknown values.

An example spreadsheet might have the following format

IDSire IDDam IDBirthdateSexWeaning Weight
11/1/1990Male354
211/1/1990Female251
311/1/1991Male327
4121/1/1991Female328
521/1/1991Male301
61/1/1991Female270
71/1/1992Male330

Synopsis

beefblup [-G SNPs_file] [-M num_response_vars] [-o report_spreadsheet]
+[--no-aod] [--no-year] [--no-season] [--no-autob] [--maternal] input_file
+[report_file]

Command line basic syntax

The most basic input is to simply pass the input file name to the program.

beefblup filename.csv

In this case beefblup will insert fixed-effects for age-of-dam, year, and season, and will calculate the EBVs for the response variable in the final column. The report will then be saved as filename_report.txt.

Suppressing automatically-calculated fixed effects

If you don't wish to include one of the automatically calculated fixed-effects from your model, you can pass arguments to suppress them.

Suppress Age-of-dam

beefblup --no-aod filename.csv

Suppress year

beefblup --no-year filename.csv

Suppress season

beefblup --no-season filename.csv

Suppress all calculated fixed effects

beefblup --no-autob filename.csv

The argument --no-autob comes from the nomenclature of assigning fixed-effects to the matrix b in Henderson's mixed-model equations.

diff --git a/dev/how-to-calculate-epds/index.html b/dev/how-to-calculate-epds/index.html new file mode 100644 index 0000000..915b873 --- /dev/null +++ b/dev/how-to-calculate-epds/index.html @@ -0,0 +1,193 @@ + +How to Calculate EPDs · beefblup

How to Calculate EPDs

Not to exclude our Australian comrades or our dairy friends, this guide could alternately be called

  • How to Calculate Expected Breeding Values (EBVs)
  • How to Calculate Predicted Transmitting Abilities (PTAs)
  • How to Calculate Expected Progeny Differences (EPDs)

Since I'm mostly talking to American beef producers, though, we'll stick with EPDs for most of this discussion.

Expected Breeding Values (EBVs) (which are more often halved and published as Expected Progeny Differences [EPDs] or Predicted Transmitting Abilities [PTAs] in the United States) are generally found using Charles Henderson's linear mixed-model equations. Great, you say, what is that? I'm glad you asked...

The mathematical model

Every genetics textbook starts with the following equation

\[P = G + E\]

Where:

  • P = phenotype
  • G = genotype (think: breeding value)
  • E = environmental factors

Now, we can't identify every environmental factor that affects phenotype, but we can identify some of them, so let's substitute E with some absolutes. A good place to start is the "contemporary group" listings for the trait of interest in the [BIF Guidelines], though for the purposes of this example, I'm only going to consider sex, and birth year.

\[P = G + E_{year} + E_{sex}\]

Where:

  • E<sub>n</sub> is the effect of n on the phenotype

Now let's say I want to find the weaning weight breeding value (G) of my favorite herd bull. I compile his stats, and then plug them into the equation and solve for G, right? Let's try that.

Calf Records

ID | Birth Year | Sex | YW (kg) – | - | - | - 1 | 1990 | Male | 354

\[354 \textup{kg} &= G_1 + E_{1990} + E_{male}\]

Hmm. I just realized I don't know any of those E values. Come to think of it, I remember from math class that I will need as many equations as I have unknowns, so I will add equations for other animals that I have records for.

Calf Records

ID | Birth Year | Sex | YW (kg) – | - | - | - 1 | 1990 | Male | 354 2 | 1990 | Female | 251 3 | 1991 | Male | 327 4 | 1991 | Female | 328 5 | 1991 | Male | 301 6 | 1991 | Female | 270 7 | 1992 | Male | 330

\[\begin{aligned} +251 \textup{kg} &= G_2 + E_{1990} + E_{female} \\ +327 \textup{kg} &= G_3 + E_{1991} + E_{male} \\ +328 \textup{kg} &= G_4 + E_{1991} + E_{female} \\ +301 \textup{kg} &= G_5 + E_{1991} + E_{male} \\ +270 \textup{kg} &= G_6 + E_{1991} + E_{female} \\ +330 \textup{kg} &= G_7 + E_{1992} + E_{male} +\end{aligned}\]

Drat! Every animal I added brings more variables into the system than it eliminates! In fact, since each cow brings in at least one term (G<sub>n</sub>), I will never be able to write enough equations to solve for G numerically. I will have to use a different approach.

The statistical model: the setup

Since I can never solve for G directly, I will have to find some way to estimate it. I can switch to a statistical model and solve for G that way. The caveat with a statistical model is that there will be some level of error, but so long as we know and can control the level of error, that will be better than not knowing G at all.

Since we're switching into a statistical space, we should also switch the variables we're using. I'll rewrite the first equation as

\[y = b + u + e\]

Where:

  • y = Phenotype
  • b = Environment
  • u = Genotype
  • e = Error

It's not as easy as simply substituting b for every E that we had above, however. The reason for that is that we must make the assumption that environment is a fixed effect and that genotype is a random effect. I'll go over why that is later, but for now, understand that we need to transform the environment terms and genotype terms separately.

We'll start with the environment terms.

The statistical model: environment as fixed effects

To properly transform the equations, I will have to introduce b<sub>mean</sub> terms in each animal's equation. This is part of the fixed effect statistical assumption, and it will let us obtain a solution.

Here are the transformed equations:

\[\begin{aligned} +354 \textup{kg} &= u_1 + b_{mean} + b_{1990} + b_{male} + e_1 \\ +251 \textup{kg} &= u_2 + b_{mean} + b_{1990} + b_{female} + e_2 \\ +327 \textup{kg} &= u_3 + b_{mean} + b_{1991} + b_{male} + e_3 \\ +328 \textup{kg} &= u_4 + b_{mean} + b_{1991} + b_{female} +e_4 \\ +301 \textup{kg} &= u_5 + b_{mean} + b_{1991} + b_{male} + e_5 \\ +270 \textup{kg} &= u_6 + b_{mean} + b_{1991} + b_{female} + e_6 \\ +330 \textup{kg} &= u_7 + b_{mean} + b_{1992} + b_{male} + e_7 +\end{aligned}\]

Statistical methods work best in matrix form, so I'm going to convert the set of equations above to a single matrix equation that means the exact same thing.

\[\begin{bmatrix} +354 \textup{kg} \\ +251 \textup{kg} \\ +327 \textup{kg} \\ +328 \textup{kg} \\ +301 \textup{kg} \\ +270 \textup{kg} \\ +330 \textup{kg} +\end{bmatrix} += +\begin{bmatrix} +u_1 \\ +u_2 \\ +u_3 \\ +u_4 \\ +u_5 \\ +u_6 \\ +u_7 +\end{bmatrix} ++ +b_{mean} ++ +\begin{bmatrix} +b_{1990} \\ +b_{1990} \\ +b_{1991} \\ +b_{1991} \\ +b_{1991} \\ +b_{1991} \\ +b_{1992} +\end{bmatrix} ++ +\begin{bmatrix} +b_{male} \\ +b_{female} \\ +b_{male} \\ +b_{female} \\ +b_{male} \\ +b_{female} \\ +b_{male} +\end{bmatrix} ++ +\begin{bmatrix} +e_1 \\ +e_2 \\ +e_3 \\ +e_4 \\ +e_5 \\ +e_6 \\ +e_7 +\end{bmatrix}\]

That's a nice equation, but now my hand is getting tired writing all those b terms over and over again, so I'm going to use [the dot product] to condense this down.

\[\begin{bmatrix} +354 \textup{kg} \\ +251 \textup{kg} \\ +327 \textup{kg} \\ +328 \textup{kg} \\ +301 \textup{kg} \\ +270 \textup{kg} \\ +330 \textup{kg} +\end{bmatrix} += +\begin{bmatrix} +u_1 \\ +u_2 \\ +u_3 \\ +u_4 \\ +u_5 \\ +u_6 \\ +u_7 +\end{bmatrix} ++ +\begin{bmatrix} +1 & 1 & 0 & 0 & 1 & 0 \\ +1 & 1 & 0 & 0 & 0 & 1 \\ +1 & 0 & 1 & 0 & 1 & 0 \\ +1 & 0 & 1 & 0 & 0 & 1 \\ +1 & 0 & 1 & 0 & 1 & 0 \\ +1 & 0 & 0 & 1 & 1 & 0 +\end{bmatrix} ++ +\begin{bmatrix} +b_{mean} \\ +b_{1990} \\ +b_{1991} \\ +b_{1992} \\ +b_{male} \\ +b_{female} +\end{bmatrix} ++ +\begin{bmatrix} +e_1 \\ +e_2 \\ +e_3 \\ +e_4 \\ +e_5 \\ +e_6 \\ +e_7 +\end{bmatrix}\]

That matrix in the middle with all the zeros and ones is called the incidence matrix, and essentially reads like a table with each row corresponding to an animal, and each column corresponding to a fixed effect. For brevity, we'll just call it X, though. One indicates that the animal and effect go together, and zero means they don't. For our record, we could write a table to go with X, and it would look like this:

Animal | mean | 1990 | 1991 | 1992 | male | female – | - | - | - | - | - | - 1 | yes | yes | no | no | yes | no 2 | yes | yes | no | no | no | yes 3 | yes | no | yes | no | yes | no 4 | yes | no | yes | no | no | yes 5 | yes | no | yes | no | yes | no 6 | yes | no | yes | no | no | yes 7 | yes | no | no | yes | yes | no

Now that we have X, we have the ability to start making changes to allow us to solve for u. Immediately, we see that X is singular, meaning it can't be solved directly. We kind of already knew that, but now we can quantify it. We calculate the [rank of X], and find that there is only enough information contained in it to solve for 4 variables, which means we need to eliminate two columns.

There are several ways to effectively eliminate fixed effects in this type of system, but one of the simplest and the most common methods is to declare a base population, and lump the fixed effects of animals within the base population into the mean fixed effect. Note that it is possible to declare a base population that has no animals in it, but that gives weird results. For this example, we'll follow the convention built into beefblup and pick the last occuring form of each variable.

Base population

<dl> <dt>Year</dt> <dd>1992</dd>

<dt>Sex</dt> <dd>Male</dd> </dl>

Now in order to use the base population, we simply drop the columns representing conformity with the traits in the base population from X. Our new equation looks like

\[\begin{bmatrix} +354 \textup{kg} \\ +251 \textup{kg} \\ +327 \textup{kg} \\ +328 \textup{kg} \\ +301 \textup{kg} \\ +270 \textup{kg} \\ +330 \textup{kg} +\end{bmatrix} += +\begin{bmatrix} +u_1 \\ +u_2 \\ +u_3 \\ +u_4 \\ +u_5 \\ +u_6 \\ +u_7 +\end{bmatrix} ++ +\begin{bmatrix} +1 & 1 & 0 1 \\ +1 & 1 & 0 0 \\ +1 & 0 & 1 1 \\ +1 & 0 & 1 0 \\ +1 & 0 & 1 1 \\ +1 & 0 & 0 1 +\end{bmatrix} ++ +\begin{bmatrix} +b_{mean} \\ +b_{1990} \\ +b_{1991} \\ +b_{male} \\ +\end{bmatrix} ++ +\begin{bmatrix} +e_1 \\ +e_2 \\ +e_3 \\ +e_4 \\ +e_5 \\ +e_6 \\ +e_7 +\end{bmatrix}\]

And the table for humans to understand:

Animal | mean | 1990 | 1991 | female – | - | - | - | - 1 | yes | yes | no | no 2 | yes | yes | no | yes 3 | yes | no | yes | no 4 | yes | no | yes | yes 5 | yes | no | yes | no 6 | yes | no | yes | yes 7 | yes | no | no | no

Even though each animal is said to participate in the mean, the result for the mean will now actually be the average of the base population. Math is weird sometimes.

Double-checking, the rank of X is still 4, so we can solve for the average of the base population, and the effect of being born in 1990, the effect of being born in 1991, and the effect of being female (although I think [Calvin already has an idea about that one]).

Whew! That was some transformation. We still haven't constrained this model enough to solve it, though. Now on to the genotype.

The statistical model: genotype as random effect

Remember I said above that genotype was a random effect? Statisticians say "a random effect is an effect that influences the variance and not the mean of the observation in question." I'm not sure exactly what that means or how that is applicable to genotype, but it does let us add an additional constraint to our model.

The basic gist of genetics is that organisms that are related to one another are similar to one another. Based on a pedigree, we can even say how related to one another animals are, and quantify that as the amount that the genotype terms should be allowed to vary between related animals.

We'll need a pedigree for our animals:

Calf Records

ID | Sire | Dam | Birth Year | Sex | YW (kg) – | - | - | - | - | - 1 | NA | NA | 1990 | Male | 354 2 | NA | NA | 1990 | Female | 251 3 | 1 | NA | 1991 | Male | 327 4 | 1 | NA | 1991 | Female | 328 5 | 1 | 2 | 1991 | Male | 301 6 | NA | 2 | 1991 | Female | 270 7 | NA | NA | 1992 | Male | 330

Now, because cows sexually reproduce, the genotype of one animal is halfway the same as that of either parent.<sup>a</sup> It should go without saying that each animal's genotype is identical to that of itself. From this we can then find the numerical multiplier for any relative (grandparent = 1/4, full sibling = 1, half sibling = 1/2, etc.). Let's write those values down in a table.

ID | 1 | 2 | 3 | 4 | 5 | 6 | 7 – | - | - | - | - | - | - | - 1 | 1 | 0 | 1/2 | 1/2 | 1/2 | 0 | 0 2 | 0 | 1 | 0 | 0 | 1/2 | 1/2 | 0 3 | 1/2 | 0 | 1 | 1/4 | 1/4 | 0 | 0 4 | 1/2 | 0 | 1/4 | 1 | 1/4 | 0 | 0 5 | 1/2 | 1/2 | 1/4 | 1/4 | 1 | 1/4 | 0 6 | 0 | 1/2 | 0 | 0 | 1/4 | 1 | 0 7 | 0 | 0 | 0 | 0 | 0 | 0 | 1

Hmm. All those numbers look suspiciously like a matrix. Why don't I put them into a matrix called A?

\[\begin{bmatrix} +1 & 0 & \frac{1}{2} & \frac{1}{2} & \frac{1}{2} & 0 & 0 \\ +0 & 1 & 0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 \\ +\frac{1}{2} & 0 & 1 & \frac{1}{4} & \frac{1}{4} & 0 & 0 \\ +\frac{1}{2} & 0 & \frac{1}{4} & 1 & \frac{1}{4} & 0 & 0 \\ +\frac{1}{2} & \frac{1}{2} & \frac{1}{4} & \frac{1}{4} & 1 & \frac{1}{4} & 0 \\ +0 & \frac{1}{2} & 0 & 0 & \frac{1}{4} & 1 & 0 \\ +0 & 0 & 0 & 0 & 0 & 0 & 1 +\end{bmatrix}\]

Now I'm going to take the matrix with all of the u values, and call it μ. To quantify the idea of genetic relationship, I will then say that

\[\textup{var}(μ) = \mathbf{A}σ_μ^2\]

Where:

  • A = the relationship matrix defined above
  • σ<sub>μ</sub><sup>2</sup> = the standard deviation of all the genotypes

To fully constrain the system, I have to make two more assumptions: 1) that the error term in each animal's equation is independent from all other error terms, and 2) that the error term for each animal is independent from the value of the genotype. I will call the matrix holding the e values ε and then say

\[\textup{var}(ϵ) = \mathbf{I}σ_ϵ^2\]

\[\textup{cov}(μ, ϵ) = \textup{cov}(ϵ, μ) = 0\]

Substituting in the matrix names, our equation now looks like

\\Large Figure 25. Nearly complete mixed-model equation

\[\begin{bmatrix} +354 \textup{kg} \\ +251 \textup{kg} \\ +327 \textup{kg} \\ +328 \textup{kg} \\ +301 \textup{kg} \\ +270 \textup{kg} \\ +330 \textup{kg} +\end{bmatrix} += μ + X +\begin{bmatrix} +b_{mean} \\ +b_{1990} \\ +b_{1991} \\ +b_{male} \\ +\end{bmatrix} ++ ϵ\]

We are going to make three changes to this equation before we are ready to solve it, but they are cosmetic details for this example.

  1. Call the matrix on the left side of the equation Y (sometimes it's called the matrix of observations)
  2. Multiply μ by an identity matrix called Z. Multiplying by the identity matrix is the matrix form of multiplying by one, so nothing changes, but if we later want to find one animal's genetic effect on another animal's performance (e.g. a maternal effects model), we can alter Z to allow that/
  3. Call the matrix with all the b values β.

With all these changes, we now have

\[Y = Z μ + X β + ϵ\]

This is the canonical form of the mixed-model equation, and the form that Charles Henderson used to first predict breeding values of livestock.

Solving the equations

Henderson proved that the mixed-model equation can be solved by the following:

\\Large Figure 27. Solution to mixed-model equation

\[\begin{bmatrix} +\hat{β} \\ +\hat{μ} +\end{bmatrix} += +\begin{bmatrix} +X'X & X'Z \\ +Z'X & Z'Z&plus;A^{-1}\lambda +\end{bmatrix}^{-1} +\begin{bmatrix} +X'Y \\ +Z'Y +\end{bmatrix}\]

Where

  • The variables with hats are the statistical estimates of their mixed-model counterparts
    • The predicted value of μ is called the Best Linear Unbiased Predictor or BLUP
    • The estimated value of β is called the Best Linear Unbiased Estimate or BLUE
  • ' is the transpose operator
  • λ is a single real number that is a function of the heritability for the trait being predicted. It can be left out in many cases (λ = 1).
    • λ = (1-h<sup>2</sup>)/h<sup>2</sup>

What happened to

Footnotes

a

An animal can share its genome with itself by a factor of more than one: that's called inbreeding! We can account for this, and beefblup does as it calculates A. This is an area that actually merits a good deal of study: see chapter 2 of Linear Models for the Prediction of Animal Breeding Values by Raphael A. Mrode (ISBN 978 1 78064 391 5).

[BIF Guidelines]: https://beefimprovement.org/wp-content/uploads/2018/03/BIFGuidelinesFinalupdated0318.pdf [The Dot Product]: https://www.khanacademy.org/math/precalculus/x9e81a4f98389efdf:matrices/x9e81a4f98389efdf:multiplying-matrices-by-matrices/v/matrix-multiplication-intro [rank of _X]: https://math.stackexchange.com/a/2080577 [Calvin already has an idea about that one]: https://www.gocomics.com/calvinandhobbes/1992/12/02

diff --git a/dev/index.html b/dev/index.html index 537ba79..85c0bef 100644 --- a/dev/index.html +++ b/dev/index.html @@ -1,2 +1,2 @@ -Home · beefblup

beefblup

Documentation for BeefBLUP.

    Missing docstring.

    Missing docstring for beefblup. Check Documenter's build log for details.

    +Home · beefblup

    beefblup

    Documentation for BeefBLUP.

      Missing docstring.

      Missing docstring for beefblup. Check Documenter's build log for details.

      diff --git a/dev/search/index.html b/dev/search/index.html index ba7ad98..edc7631 100644 --- a/dev/search/index.html +++ b/dev/search/index.html @@ -1,2 +1,2 @@ -Search · beefblup

      Loading search...

        +Search · beefblup

        Loading search...

          diff --git a/dev/search_index.js b/dev/search_index.js index aca32c0..d9b8b68 100644 --- a/dev/search_index.js +++ b/dev/search_index.js @@ -1,3 +1,3 @@ var documenterSearchIndex = {"docs": -[{"location":"","page":"Home","title":"Home","text":"CurrentModule = BeefBLUP","category":"page"},{"location":"#beefblup","page":"Home","title":"beefblup","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"Documentation for BeefBLUP.","category":"page"},{"location":"","page":"Home","title":"Home","text":"","category":"page"},{"location":"","page":"Home","title":"Home","text":"beefblup","category":"page"}] +[{"location":"beefblup-cli/","page":"-","title":"-","text":"CurrentModule = BeefBLUP","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup Command Line Interface (CLI) documentation","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"A work in progress","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"Notice: This document is a draft for what the command-line interface for beefblup would look like as of version 1.0, if beefblup was even a command-line application to begin with (it's not). It is modeled (loosely) after the man page format. It is not intended to be taken seriously, but instead to serve as a useful thought experiment and brainstorming ground on the future of beefblup. Please use it if it clarifies things for you. If it doesn't, ignore it.","category":"page"},{"location":"beefblup-cli/#Input-file","page":"-","title":"Input file","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup requires a very specific format of input file. The format may be in comma-separated values (CSV) or Excel 2007+ (XLSX) format. CSV files should not be quoted (and therefore cannot have commas within cell values). Other formats may be forthcoming.","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"A beefblup data file must have at least six columns appearing in this order:","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"ID\nSire ID\nDam ID\nBirthdate\nFixed effect(s)\nResponse variable(s)","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"The first row always contains column names. The values of column names are unimportant for the first four columns, as they will always be treated the same regardless of the name. The generated report will use the column names of fixed effects and response variables as given.","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"Each fixed effect should have its own column, to as many as are needed. There is no limit to the number of fixed effects as defined by beefblup, however its dependencies might have some. The same rules apply to response variables.","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"Unknown values should be left blank (,,). Do not substitute null placeholders (e.g. NULL, NA, 0, nothing, undefined, etc.) for unknown values.","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"An example spreadsheet might have the following format","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"ID Sire ID Dam ID Birthdate Sex Weaning Weight\n1 1/1/1990 Male 354\n2 1 1/1/1990 Female 251\n3 1 1/1/1991 Male 327\n4 1 2 1/1/1991 Female 328\n5 2 1/1/1991 Male 301\n6 1/1/1991 Female 270\n7 1/1/1992 Male 330","category":"page"},{"location":"beefblup-cli/#Synopsis","page":"-","title":"Synopsis","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup [-G SNPs_file] [-M num_response_vars] [-o report_spreadsheet]\n[--no-aod] [--no-year] [--no-season] [--no-autob] [--maternal] input_file\n[report_file]","category":"page"},{"location":"beefblup-cli/#Command-line-basic-syntax","page":"-","title":"Command line basic syntax","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"The most basic input is to simply pass the input file name to the program.","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup filename.csv","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"In this case beefblup will insert fixed-effects for age-of-dam, year, and season, and will calculate the EBVs for the response variable in the final column. The report will then be saved as filename_report.txt.","category":"page"},{"location":"beefblup-cli/#Suppressing-automatically-calculated-fixed-effects","page":"-","title":"Suppressing automatically-calculated fixed effects","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"If you don't wish to include one of the automatically calculated fixed-effects from your model, you can pass arguments to suppress them.","category":"page"},{"location":"beefblup-cli/#Suppress-Age-of-dam","page":"-","title":"Suppress Age-of-dam","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup --no-aod filename.csv","category":"page"},{"location":"beefblup-cli/#Suppress-year","page":"-","title":"Suppress year","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup --no-year filename.csv","category":"page"},{"location":"beefblup-cli/#Suppress-season","page":"-","title":"Suppress season","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup --no-season filename.csv","category":"page"},{"location":"beefblup-cli/#Suppress-all-calculated-fixed-effects","page":"-","title":"Suppress all calculated fixed effects","text":"","category":"section"},{"location":"beefblup-cli/","page":"-","title":"-","text":"beefblup --no-autob filename.csv","category":"page"},{"location":"beefblup-cli/","page":"-","title":"-","text":"The argument --no-autob comes from the nomenclature of assigning fixed-effects to the matrix b in Henderson's mixed-model equations.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"CurrentModule = BeefBLUP","category":"page"},{"location":"how-to-calculate-epds/#How-to-Calculate-EPDs","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Not to exclude our Australian comrades or our dairy friends, this guide could alternately be called","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"How to Calculate Expected Breeding Values (EBVs)\nHow to Calculate Predicted Transmitting Abilities (PTAs)\nHow to Calculate Expected Progeny Differences (EPDs)","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Since I'm mostly talking to American beef producers, though, we'll stick with EPDs for most of this discussion.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Expected Breeding Values (EBVs) (which are more often halved and published as Expected Progeny Differences [EPDs] or Predicted Transmitting Abilities [PTAs] in the United States) are generally found using Charles Henderson's linear mixed-model equations. Great, you say, what is that? I'm glad you asked...","category":"page"},{"location":"how-to-calculate-epds/#The-mathematical-model","page":"How to Calculate EPDs","title":"The mathematical model","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Every genetics textbook starts with the following equation","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"P = G + E","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Where:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"P = phenotype\nG = genotype (think: breeding value)\nE = environmental factors","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Now, we can't identify every environmental factor that affects phenotype, but we can identify some of them, so let's substitute E with some absolutes. A good place to start is the \"contemporary group\" listings for the trait of interest in the [BIF Guidelines], though for the purposes of this example, I'm only going to consider sex, and birth year.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"P = G + E_year + E_sex","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Where:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"En is the effect of n on the phenotype","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Now let's say I want to find the weaning weight breeding value (G) of my favorite herd bull. I compile his stats, and then plug them into the equation and solve for G, right? Let's try that.","category":"page"},{"location":"how-to-calculate-epds/#Calf-Records","page":"How to Calculate EPDs","title":"Calf Records","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"ID | Birth Year | Sex | YW (kg) – | - | - | - 1 | 1990 | Male | 354","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"354 textupkg = G_1 + E_1990 + E_male","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Hmm. I just realized I don't know any of those E values. Come to think of it, I remember from math class that I will need as many equations as I have unknowns, so I will add equations for other animals that I have records for.","category":"page"},{"location":"how-to-calculate-epds/#Calf-Records-2","page":"How to Calculate EPDs","title":"Calf Records","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"ID | Birth Year | Sex | YW (kg) – | - | - | - 1 | 1990 | Male | 354 2 | 1990 | Female | 251 3 | 1991 | Male | 327 4 | 1991 | Female | 328 5 | 1991 | Male | 301 6 | 1991 | Female | 270 7 | 1992 | Male | 330","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginaligned\n251 textupkg = G_2 + E_1990 + E_female \n327 textupkg = G_3 + E_1991 + E_male \n328 textupkg = G_4 + E_1991 + E_female \n301 textupkg = G_5 + E_1991 + E_male \n270 textupkg = G_6 + E_1991 + E_female \n330 textupkg = G_7 + E_1992 + E_male\nendaligned","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Drat! Every animal I added brings more variables into the system than it eliminates! In fact, since each cow brings in at least one term (Gn), I will never be able to write enough equations to solve for G numerically. I will have to use a different approach.","category":"page"},{"location":"how-to-calculate-epds/#The-statistical-model:-the-setup","page":"How to Calculate EPDs","title":"The statistical model: the setup","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Since I can never solve for G directly, I will have to find some way to estimate it. I can switch to a statistical model and solve for G that way. The caveat with a statistical model is that there will be some level of error, but so long as we know and can control the level of error, that will be better than not knowing G at all.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Since we're switching into a statistical space, we should also switch the variables we're using. I'll rewrite the first equation as","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"y = b + u + e","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Where:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"y = Phenotype\nb = Environment\nu = Genotype\ne = Error","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"It's not as easy as simply substituting b for every E that we had above, however. The reason for that is that we must make the assumption that environment is a fixed effect and that genotype is a random effect. I'll go over why that is later, but for now, understand that we need to transform the environment terms and genotype terms separately.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"We'll start with the environment terms.","category":"page"},{"location":"how-to-calculate-epds/#The-statistical-model:-environment-as-fixed-effects","page":"How to Calculate EPDs","title":"The statistical model: environment as fixed effects","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"To properly transform the equations, I will have to introduce bmean terms in each animal's equation. This is part of the fixed effect statistical assumption, and it will let us obtain a solution.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Here are the transformed equations:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginaligned\n354 textupkg = u_1 + b_mean + b_1990 + b_male + e_1 \n251 textupkg = u_2 + b_mean + b_1990 + b_female + e_2 \n327 textupkg = u_3 + b_mean + b_1991 + b_male + e_3 \n328 textupkg = u_4 + b_mean + b_1991 + b_female +e_4 \n301 textupkg = u_5 + b_mean + b_1991 + b_male + e_5 \n270 textupkg = u_6 + b_mean + b_1991 + b_female + e_6 \n330 textupkg = u_7 + b_mean + b_1992 + b_male + e_7\nendaligned","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Statistical methods work best in matrix form, so I'm going to convert the set of equations above to a single matrix equation that means the exact same thing.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginbmatrix\n354 textupkg \n251 textupkg \n327 textupkg \n328 textupkg \n301 textupkg \n270 textupkg \n330 textupkg\nendbmatrix\n=\nbeginbmatrix\nu_1 \nu_2 \nu_3 \nu_4 \nu_5 \nu_6 \nu_7\nendbmatrix\n+\nb_mean\n+\nbeginbmatrix\nb_1990 \nb_1990 \nb_1991 \nb_1991 \nb_1991 \nb_1991 \nb_1992\nendbmatrix\n+\nbeginbmatrix\nb_male \nb_female \nb_male \nb_female \nb_male \nb_female \nb_male\nendbmatrix\n+\nbeginbmatrix\ne_1 \ne_2 \ne_3 \ne_4 \ne_5 \ne_6 \ne_7\nendbmatrix","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"That's a nice equation, but now my hand is getting tired writing all those b terms over and over again, so I'm going to use [the dot product] to condense this down.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginbmatrix\n354 textupkg \n251 textupkg \n327 textupkg \n328 textupkg \n301 textupkg \n270 textupkg \n330 textupkg\nendbmatrix\n=\nbeginbmatrix\nu_1 \nu_2 \nu_3 \nu_4 \nu_5 \nu_6 \nu_7\nendbmatrix\n+\nbeginbmatrix\n1 1 0 0 1 0 \n1 1 0 0 0 1 \n1 0 1 0 1 0 \n1 0 1 0 0 1 \n1 0 1 0 1 0 \n1 0 0 1 1 0\nendbmatrix\n+\nbeginbmatrix\nb_mean \nb_1990 \nb_1991 \nb_1992 \nb_male \nb_female\nendbmatrix\n+\nbeginbmatrix\ne_1 \ne_2 \ne_3 \ne_4 \ne_5 \ne_6 \ne_7\nendbmatrix","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"That matrix in the middle with all the zeros and ones is called the incidence matrix, and essentially reads like a table with each row corresponding to an animal, and each column corresponding to a fixed effect. For brevity, we'll just call it X, though. One indicates that the animal and effect go together, and zero means they don't. For our record, we could write a table to go with X, and it would look like this:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Animal | mean | 1990 | 1991 | 1992 | male | female – | - | - | - | - | - | - 1 | yes | yes | no | no | yes | no 2 | yes | yes | no | no | no | yes 3 | yes | no | yes | no | yes | no 4 | yes | no | yes | no | no | yes 5 | yes | no | yes | no | yes | no 6 | yes | no | yes | no | no | yes 7 | yes | no | no | yes | yes | no","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Now that we have X, we have the ability to start making changes to allow us to solve for u. Immediately, we see that X is singular, meaning it can't be solved directly. We kind of already knew that, but now we can quantify it. We calculate the [rank of X], and find that there is only enough information contained in it to solve for 4 variables, which means we need to eliminate two columns.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"There are several ways to effectively eliminate fixed effects in this type of system, but one of the simplest and the most common methods is to declare a base population, and lump the fixed effects of animals within the base population into the mean fixed effect. Note that it is possible to declare a base population that has no animals in it, but that gives weird results. For this example, we'll follow the convention built into beefblup and pick the last occuring form of each variable.","category":"page"},{"location":"how-to-calculate-epds/#Base-population","page":"How to Calculate EPDs","title":"Base population","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"
          Year
          1992
          ","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"
          Sex
          Male
          ","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Now in order to use the base population, we simply drop the columns representing conformity with the traits in the base population from X. Our new equation looks like","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginbmatrix\n354 textupkg \n251 textupkg \n327 textupkg \n328 textupkg \n301 textupkg \n270 textupkg \n330 textupkg\nendbmatrix\n=\nbeginbmatrix\nu_1 \nu_2 \nu_3 \nu_4 \nu_5 \nu_6 \nu_7\nendbmatrix\n+\nbeginbmatrix\n1 1 0 1 \n1 1 0 0 \n1 0 1 1 \n1 0 1 0 \n1 0 1 1 \n1 0 0 1\nendbmatrix\n+\nbeginbmatrix\nb_mean \nb_1990 \nb_1991 \nb_male \nendbmatrix\n+\nbeginbmatrix\ne_1 \ne_2 \ne_3 \ne_4 \ne_5 \ne_6 \ne_7\nendbmatrix","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"And the table for humans to understand:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Animal | mean | 1990 | 1991 | female – | - | - | - | - 1 | yes | yes | no | no 2 | yes | yes | no | yes 3 | yes | no | yes | no 4 | yes | no | yes | yes 5 | yes | no | yes | no 6 | yes | no | yes | yes 7 | yes | no | no | no","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Even though each animal is said to participate in the mean, the result for the mean will now actually be the average of the base population. Math is weird sometimes.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Double-checking, the rank of X is still 4, so we can solve for the average of the base population, and the effect of being born in 1990, the effect of being born in 1991, and the effect of being female (although I think [Calvin already has an idea about that one]).","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Whew! That was some transformation. We still haven't constrained this model enough to solve it, though. Now on to the genotype.","category":"page"},{"location":"how-to-calculate-epds/#The-statistical-model:-genotype-as-random-effect","page":"How to Calculate EPDs","title":"The statistical model: genotype as random effect","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Remember I said above that genotype was a random effect? Statisticians say \"a random effect is an effect that influences the variance and not the mean of the observation in question.\" I'm not sure exactly what that means or how that is applicable to genotype, but it does let us add an additional constraint to our model.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"The basic gist of genetics is that organisms that are related to one another are similar to one another. Based on a pedigree, we can even say how related to one another animals are, and quantify that as the amount that the genotype terms should be allowed to vary between related animals.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"We'll need a pedigree for our animals:","category":"page"},{"location":"how-to-calculate-epds/#Calf-Records-3","page":"How to Calculate EPDs","title":"Calf Records","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"ID | Sire | Dam | Birth Year | Sex | YW (kg) – | - | - | - | - | - 1 | NA | NA | 1990 | Male | 354 2 | NA | NA | 1990 | Female | 251 3 | 1 | NA | 1991 | Male | 327 4 | 1 | NA | 1991 | Female | 328 5 | 1 | 2 | 1991 | Male | 301 6 | NA | 2 | 1991 | Female | 270 7 | NA | NA | 1992 | Male | 330","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Now, because cows sexually reproduce, the genotype of one animal is halfway the same as that of either parent.a It should go without saying that each animal's genotype is identical to that of itself. From this we can then find the numerical multiplier for any relative (grandparent = 1/4, full sibling = 1, half sibling = 1/2, etc.). Let's write those values down in a table.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"ID | 1 | 2 | 3 | 4 | 5 | 6 | 7 – | - | - | - | - | - | - | - 1 | 1 | 0 | 1/2 | 1/2 | 1/2 | 0 | 0 2 | 0 | 1 | 0 | 0 | 1/2 | 1/2 | 0 3 | 1/2 | 0 | 1 | 1/4 | 1/4 | 0 | 0 4 | 1/2 | 0 | 1/4 | 1 | 1/4 | 0 | 0 5 | 1/2 | 1/2 | 1/4 | 1/4 | 1 | 1/4 | 0 6 | 0 | 1/2 | 0 | 0 | 1/4 | 1 | 0 7 | 0 | 0 | 0 | 0 | 0 | 0 | 1","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Hmm. All those numbers look suspiciously like a matrix. Why don't I put them into a matrix called A?","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginbmatrix\n1 0 frac12 frac12 frac12 0 0 \n0 1 0 0 frac12 frac12 0 \nfrac12 0 1 frac14 frac14 0 0 \nfrac12 0 frac14 1 frac14 0 0 \nfrac12 frac12 frac14 frac14 1 frac14 0 \n0 frac12 0 0 frac14 1 0 \n0 0 0 0 0 0 1\nendbmatrix","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Now I'm going to take the matrix with all of the u values, and call it μ. To quantify the idea of genetic relationship, I will then say that","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"textupvar(μ) = mathbfAσ_μ^2","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Where:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"A = the relationship matrix defined above\nσμ2 = the standard deviation of all the genotypes","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"To fully constrain the system, I have to make two more assumptions: 1) that the error term in each animal's equation is independent from all other error terms, and 2) that the error term for each animal is independent from the value of the genotype. I will call the matrix holding the e values ε and then say","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"textupvar(ϵ) = mathbfIσ_ϵ^2","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"textupcov(μ ϵ) = textupcov(ϵ μ) = 0","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Substituting in the matrix names, our equation now looks like","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"(Image: \\Large Figure 25. Nearly complete mixed-model equation)","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginbmatrix\n354 textupkg \n251 textupkg \n327 textupkg \n328 textupkg \n301 textupkg \n270 textupkg \n330 textupkg\nendbmatrix\n= μ + X\nbeginbmatrix\nb_mean \nb_1990 \nb_1991 \nb_male \nendbmatrix\n+ ϵ","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"We are going to make three changes to this equation before we are ready to solve it, but they are cosmetic details for this example.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Call the matrix on the left side of the equation Y (sometimes it's called the matrix of observations)\nMultiply μ by an identity matrix called Z. Multiplying by the identity matrix is the matrix form of multiplying by one, so nothing changes, but if we later want to find one animal's genetic effect on another animal's performance (e.g. a maternal effects model), we can alter Z to allow that/\nCall the matrix with all the b values β.","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"With all these changes, we now have","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Y = Z μ + X β + ϵ","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"This is the canonical form of the mixed-model equation, and the form that Charles Henderson used to first predict breeding values of livestock.","category":"page"},{"location":"how-to-calculate-epds/#Solving-the-equations","page":"How to Calculate EPDs","title":"Solving the equations","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Henderson proved that the mixed-model equation can be solved by the following:","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"(Image: \\Large Figure 27. Solution to mixed-model equation)","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"beginbmatrix\nhatβ \nhatμ\nendbmatrix\n=\nbeginbmatrix\nXX XZ \nZX ZZplusA^-1lambda\nendbmatrix^-1\nbeginbmatrix\nXY \nZY\nendbmatrix","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"Where","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"The variables with hats are the statistical estimates of their mixed-model counterparts\nThe predicted value of μ is called the Best Linear Unbiased Predictor or BLUP\nThe estimated value of β is called the Best Linear Unbiased Estimate or BLUE\n' is the transpose operator\nλ is a single real number that is a function of the heritability for the trait being predicted. It can be left out in many cases (λ = 1).\nλ = (1-h2)/h2","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"What happened to","category":"page"},{"location":"how-to-calculate-epds/#Footnotes","page":"How to Calculate EPDs","title":"Footnotes","text":"","category":"section"},{"location":"how-to-calculate-epds/#a","page":"How to Calculate EPDs","title":"a","text":"","category":"section"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"An animal can share its genome with itself by a factor of more than one: that's called inbreeding! We can account for this, and beefblup does as it calculates A. This is an area that actually merits a good deal of study: see chapter 2 of Linear Models for the Prediction of Animal Breeding Values by Raphael A. Mrode (ISBN 978 1 78064 391 5).","category":"page"},{"location":"how-to-calculate-epds/","page":"How to Calculate EPDs","title":"How to Calculate EPDs","text":"[BIF Guidelines]: https://beefimprovement.org/wp-content/uploads/2018/03/BIFGuidelinesFinalupdated0318.pdf [The Dot Product]: https://www.khanacademy.org/math/precalculus/x9e81a4f98389efdf:matrices/x9e81a4f98389efdf:multiplying-matrices-by-matrices/v/matrix-multiplication-intro [rank of _X]: https://math.stackexchange.com/a/2080577 [Calvin already has an idea about that one]: https://www.gocomics.com/calvinandhobbes/1992/12/02","category":"page"},{"location":"","page":"Home","title":"Home","text":"CurrentModule = BeefBLUP","category":"page"},{"location":"#beefblup","page":"Home","title":"beefblup","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"Documentation for BeefBLUP.","category":"page"},{"location":"","page":"Home","title":"Home","text":"","category":"page"},{"location":"","page":"Home","title":"Home","text":"beefblup","category":"page"}] }