Browsing by Subject "Genotype-environment interaction"
Now showing 1 - 3 of 3
- Results Per Page
- Sort Options
Publication Analyses of quantitative traits in varying environments in pigs and Brown Swiss cattle(2020) Imort-Just, Nina Annik; Bennewitz, JörnComplex breeding programmes are used worldwide to achieve genetic progress in quantitative traits. These programmes are basically similarly structured, independent of breed and species, and are characterised by successive steps. The adjustment and revision of breeding programmes is of ongoing interest due to several reasons, e.g. research findings and various political, economic, social and ethical aspects. For the long-term improvement of livestock populations, multiple traits are included in the breeding goals of such programmes. Unfavourable genetic correlations between economically important production traits and functional traits compromise the genetic progress in animal health and welfare. Novel functional and behavioural traits and the adjustment of relative economic weights for the optimisation of total merit indices are investigated in research projects. In addition, genotype by environment interactions (GxE) can influence the genetic improvement of livestock populations and their consideration is crucial due to the globalisation of breeding structures and associated varying production environments. The main objectives of this thesis were 1) to investigate novel behavioural traits in pig breeding, 2) to derive environmental-specific relative economic weights based on consumers preferences for Brown Swiss cattle, 3) to estimate GxE at a trait and an index level by applying bivariate sire models in Brown Swiss cattle and 4) to analyse GxE for different production and functional traits in Brown Swiss cattle using reaction norm models. In Chapter 1 genetic parameters for mothering ability traits were estimated. Heritabilities (h²) were estimated by applying a linear mixed- and a threshold model, ranging between 0.02 to 0.07 and 0.05 to 0.15, respectively. The slightly higher estimates for the repeatability ranged from 0.05 to 0.09 and from 0.08 to 0.17, indicating the influence of the permanent environment. Highest h² were found for the group- and nursing behaviour of the sows. Highest genetic correlations were found between group-/nursing behaviour and litter balance and piglet weaning weight with estimates between 0.71 and 0.86. Accelerating genetic gain for improved mothering ability in sows is possible. In Chapter 2, the importance of breeding traits of Brown Swiss cattle in terms of selection decisions of farm managers was evaluated by using a discrete choice experiment (DCE). Environment-specific relative economic weights (REW) and the marginal willingness to pay were estimated by using a conditional logit model. Several trait complexes, the semen price, interactions between these traits and specific characteristics of the farms were included. Farmers showed highest preferences for the milk value, conformation/udder and fitness. Interactions indicated a great importance of the fitness trait complex for organically managed farms compared to conventional farmers. The latter weighted the milk value and the trait complex conformation/udder high. The method is suitable to judge farmers’ preferences for specific traits, especially those which have not yet been monetarily evaluated. Chapter 3 investigated GxE at a trait and an index level for Brown Swiss sires. Bivariate sire models were used to estimate genetic correlations between organic and conventional production systems and two altitude level of the farms for milk production traits and several functional traits. Furthermore, to determine possible GxE and rerankings at an index level, total merit indices for the sires were calculated. The genetic correlations were high between the analysed environments, ranging from 0.79 (first insemination to conception between different altitude levels) to 0.99 (calving to first insemination, cystic ovaries, maternal stillbirth between production systems). The results indicate no severe GxE at a trait level. No putative GxE effects were found for the production system environments at an index level. In Chapter 4, GxE for various production and functional traits in Brown Swiss cattle were analysed using random regression reaction norm models. The continuous environmental descriptor milk energy yield (MEY) was calculated as a linear combination of average herd effects, obtained from the routine breeding value estimation, of milk-, fat- and protein yield. The applied reaction norm model included a random sire effect and a random sire slope effect (environmental sensitivity), i.e. the random regression coefficient of the regression of a specific sire on the environmental descriptor MEY. To investigate putative GxE effects, breeding values for the sires were estimated. Results showed no severe GxE for the functional traits but for the production trait fat yield. In addition, the slope variances as descriptors of the environmental sensitivity and the Spearman rank correlations between the estimated breeding values of the sires at different environmental levels indicate no severe GxE for the investigated traits.Publication Model selection by cross-validation in multi-environment trials(2017) Hadasch, Steffen; Piepho, Hans-PeterIn plant breeding, estimation of the performance of genotypes across a set of tested environments (genotype means), and the estimation of the environment-specific performances of the genotypes (genotype-environment means) are important tasks. For this purpose, breeders conduct multi-environment trials (MET) in which a set of genotypes is tested in a set of environments. The data from such experiments are typically analysed by mixed models as such models for example allow modelling the genotypes using random effects which may be correlated according to their genetic information. The data from MET are often high-dimensional and the covariance matrix of the data may contain many parameters that need to be estimated. To circumvent computational burdens, the data can be analysed in a stage-wise fashion. In the stage-wise analysis, the covariance matrix of the data needs to be taken into account in the estimation of the individual stages. In the analysis of MET data, there is usually a set of candidate models from which the one that fits bets to the objective of the breeder needs to be determined. Such a model selection can be done by cross validation (CV). In the application of CV schemes, different objectives of the breeder can be evaluated using an appropriate sampling strategy. In the application of a CV, both the sampling strategy and the evaluation of the model need to take the correlation of the data into account to evaluate the model performance adequately. In this work, two different types of models that are used for the analysis of MET were focused. In Chapter 2, models that use genetic marker information to estimate the genotype means were considered. In Chapters 3 and 4, the estimation of genotype-environment means using models that include multiplicative terms to describe the genotype-environment interaction, namely the additive main effects and multiplicative interaction (AMMI), and the genotype and genotype-environment interaction (GGE) model, were focused. In all the Chapters, the models were estimated in a stage-wise fashion. Furthermore, CV was used in Chapters 2 and 3 to determine the most appropriate model from a set of candidate models. In Chapter 2, two traits of a biparental lettuce (Lactuca sativa L.) population were analysed by models for (i) phenotypic selection, (ii) marker-assisted selection using QTL-linked markers, (iii) genomic prediction using all available molecular markers, and (iv) a combination of genomic prediction and QTL-linked markers. Using different sampling strategies in a CV, the predictive performances of these models were compared in terms of different objectives of a breeder, namely predicting unobserved genotypes, predicting genotypes in an unobserved environment, and predicting unobserved genotypes in an unobserved environment. Generally, the genomic prediction model outperformed marker assisted and phenotypic selection when there are only a few markers with large effects, while the marker assisted selection outperformed genomic prediction when the number of markers with large effects increases. Furthermore, the results obtained for the different objectives indicate that the predictive performance of the models in terms of predicting (unobserved) genotypes in an unobserved environment is reduced due to the presence of genotype-environment interaction. In AMMI/GGE models, the number of multiplicative terms can be determined by CV. In Chapter 3, different CV schemes were compared in a simulation study in terms of recovering the true (simulated) number of multiplicative terms, and in terms of the mean squared error of the estimated genotype-environment means. The data were simulated using the estimated variance components of a randomized complete block design and a resolvable incomplete block design. The effects of the experimental design (replicates and blocks) need to be taken into account in the application of a CV in order to evaluate the predictive performance of the model adequately. In Chapter 3, the experimental design was accounted for by an adjustment of the data for the design effects estimated from all data before applying a CV scheme. The results of the simulation study show that an adjustment of the data is required to determine the number of multiplicative terms in AMMI/GGE models. Furthermore, the results indicate that different CV schemes can be used with similar efficiencies provided that the data were adjusted adequately. AMMI/GGE models are typically estimated in a two-stage analysis in which the first stage consists of estimating the genotype-environment means while the second stage consists of estimating main effects of genotypes and environments and the multiplicative interaction. The genotype-environment means estimated in the first stage are not independent when effects of the experimental design are modelled as random effects. In such a case, estimation of the second stage should be done by a weighted (generalized least squares) estimation where a weighting matrix is used to take the covariance matrix of the estimated genotype-environment means into account. In Chapter 4, three different algorithms which can take the full covariance matrix of the genotype-environment means into account are introduced to estimate the AMMI/GGE model in a weighted fashion. To investigate the effectiveness of the weighted estimation, the algorithms were implemented using different weighting matrices, including (i) an identity matrix (unweighted estimation), (ii) a diagonal approximation of the inverse covariance matrix of the genotype-environment means, and (iii) the full inverse covariance matrix. The different weighting strategies were compared in a simulation study in terms of the mean squared error of the estimated genotype-environment means, multiplicative interaction effects, and Biplot coordinates. The results of the simulation study show that weighted estimation of the AMMI/GGE model generally outperformed unweighted estimation. Furthermore, the effectiveness of a weighted estimation increased when the heterogeneity in the covariance matrix of the estimated genotype-environment means increased. The analysis of MET in a stage-wise fashion is an efficient procedure to estimate a model for MET data, whereas the covariance structure of the data needs to be taken into account in each stage in order to estimate the model appropriately. When correlated data are used in a CV, the correlation can be taken into account by an appropriate choice of training and validation data, by an adjustment of the data before applying a CV scheme and by the success criterion used in a CV scheme.Publication Statistical methods for analysis of multienvironment trials in plant breeding : accuracy and precision(2021) Buntaran, Harimurti; Piepho, Hans-PeterMultienvironment trials (MET) are carried out every year in different environmental conditions to evaluate a vast number of cultivars, i.e., yield, because different cultivars perform differently in various environmental conditions, known as genotype×environment interactions. MET aim to provide accurate information on cultivar performance so that a recommendation of which cultivar performs the best in a growers’ field condition can be available. MET data is often analysed via mixed models, which allow the cultivar effect to be random. The random effect of cultivar enables genetic correlation to be exploited across zones and considering the trials’ heterogeneity. A zone can be viewed as a larger target of population environments. The accuracy and precision of the cultivar predictions are crucial to be evaluated. The prediction accuracy can be evaluated via a cross-validation (CV) study, and the model selection can be done based on the lowest mean squared error prediction (MSEP). Also, since the trials’ locations hardly coincide with growers’ field, the precision of predictions needs to be evaluated via standard errors of predictions of cultivar values (SEPV) and standard errors of the predictions of pairwise differences of cultivar values (SEPD). The central objective of this thesis is to assess the model performance and conduct model selection via a CV study for zone-based cultivar predictions. Chapter 2 assessed the performance between empirical best linear unbiased estimations (EBLUE) and empirical best linear unbiased predictions (EBLUP) for zone-based prediction. Different CV schemes were done for the single-year and multi-year datasets to mimic the practice. A complex covariance structure such as factor-analytic (FA) was imposed to account for the heterogeneity of cultivar×zone (CZ) effect. The MSEP showed that the EBLUP models outperformed the EBLUE models. The zonation was necessary since it improved the accuracy and was preferable to make cultivar recommendations. The FA structure did not improve the accuracy compared to the simpler covariance structure, and so the EBLUP model with a simple covariance structure is sufficient for the single and multi-year datasets. Chapter 3 assessed the single-stage and stagewise analyses. The three weighting methods were compared in the stagewise analysis: two diagonal approximation methods and the fully efficient method with the unweighted analysis. The assessment was based on the MSEP instead of Pearson’s and Spearman’s correlation coefficients since the correlation coefficients are often very close between the compared models. The MSEP showed that the single-stage EBLUP and the stagewise weighting EBLUP strategy were very similar. Thus, the loss of information due to diagonal approximation is minor. In fact, the MSEP showed a more apparent distinction between the single-stage and the stagewise weighting analyses with the unweighted EBLUE compared to the correlation coefficients. The simple compound-symmetric covariance structure was sufficient for the CZ effect than the more complex structures. The choice between the single-stage and stagewise weighting analysis, thus, depends on the computational resources and the practicality of data handling. Chapter 4 assessed the accuracy and precision of the predictions for the new locations. The environmental covariates were combined with the EBLUP in the random coefficient (RC) models since the covariates provide more information for the new locations. The MSEP showed that the RC models were not the model with the smallest MSEP, but the RC models had the lowest SEPV and SEPD. Thus, the model selection can be done by joint consideration of the MSEP, SEPV, and SEPD. The models with EBLUE and covariate interaction effects performed poorly regarding the MSEP. The EBLUP models without RC performed best, but the SEPV and SEPD were large, considered unreliable. The covariate scale and selection are essential to obtain a positive definite covariance matrix. Employing unstructured covariance int the RC is crucial to maintaining the RC models’ invariance feature. The RC framework is suitable to be implemented with GIS data to provide an accurate and precise projection of cultivar performance for the new locations or environments. To conclude, the EBLUP model for zoned-based predictions should be preferred to obtain the predictions and rankings closer to the true values and rankings. The stagewise weighting analysis can be recommended due to its practicality and its computational efficiency. Furthermore, projecting cultivar performances to the new locations should be done to provide more targeted information for growers. The available environmental covariates can be utilised to improve the predictions’ accuracy and precision in the new locations in the RC model framework. Such information is certainly more valuable for growers and breeders than just providing means across a whole target population of environments.