Institut für Pflanzenzüchtung, Saatgutforschung und Populationsgenetik
Permanent URI for this collectionhttps://hohpublica.uni-hohenheim.de/handle/123456789/13
Browse
Browsing Institut für Pflanzenzüchtung, Saatgutforschung und Populationsgenetik by Sustainable Development Goals "2"
Now showing 1 - 13 of 13
- Results Per Page
- Sort Options
Publication DeepCob: precise and high-throughput analysis of maize cob geometry using deep learning with an application in genebank phenomics(2021) Kienbaum, Lydia; Correa Abondano, Miguel; Blas, Raul; Schmid, KarlBackground: Maize cobs are an important component of crop yield that exhibit a high diversity in size, shape and color in native landraces and modern varieties. Various phenotyping approaches were developed to measure maize cob parameters in a high throughput fashion. More recently, deep learning methods like convolutional neural networks (CNNs) became available and were shown to be highly useful for high-throughput plant phenotyping. We aimed at comparing classical image segmentation with deep learning methods for maize cob image segmentation and phenotyping using a large image dataset of native maize landrace diversity from Peru. Results: Comparison of three image analysis methods showed that a Mask R-CNN trained on a diverse set of maize cob images was highly superior to classical image analysis using the Felzenszwalb-Huttenlocher algorithm and a Window-based CNN due to its robustness to image quality and object segmentation accuracy (r = 0.99). We integrated Mask R-CNN into a high-throughput pipeline to segment both maize cobs and rulers in images and perform an automated quantitative analysis of eight phenotypic traits, including diameter, length, ellipticity, asymmetry, aspect ratio and average values of red, green and blue color channels for cob color. Statistical analysis identified key training parameters for efficient iterative model updating. We also show that a small number of 10–20 images is sufficient to update the initial Mask R-CNN model to process new types of cob images. To demonstrate an application of the pipeline we analyzed phenotypic variation in 19,867 maize cobs extracted from 3449 images of 2484 accessions from the maize genebank of Peru to identify phenotypically homogeneous and heterogeneous genebank accessions using multivariate clustering. Conclusions: Single Mask R-CNN model and associated analysis pipeline are widely applicable tools for maize cob phenotyping in contexts like genebank phenomics or plant breeding.Publication Effects of using deep learning to predict the geographic origin of barley genebank accessions on genome–environment association studies(2025) Chang, Che-Wei; Schmid, KarlGenome–environment association (GEA) is an approach for identifying adaptive loci by combining genetic variation with environmental parameters, offering potential for improving crop resilience. However, its application to genebank accessions is limited by missing geographic origin data. To address this limitation, we explored the use of neural networks to predict the geographic origins of barley accessions and integrate imputed environmental data into GEA. Neural networks demonstrated high accuracy in cross-validation but occasionally produced ecologically implausible predictions as models solely considered geographical proximity. For example, some predicted origins were located within non-arable regions, such as the Mediterranean Sea. Using barley flowering time genes as benchmarks, GEA integrating imputed environmental data ( N=11,032) displayed partially concordant yet complementary detection of genomic regions near flowering time genes compared to regular GEA ( N=1,626), highlighting the potential of GEA with imputed data to complement regular GEA in uncovering novel adaptive loci. Also, contrary to our initial hypothesis anticipating a significant improvement in GEA performance by increasing sample size, our simulations yield unexpected insights. Our study suggests potential limitations in the sensitivity of GEA approaches to the considerable expansion in sample size achieved through predicting missing geographical data. Overall, our study provides insights into leveraging incomplete geographical origin data by integrating deep learning with GEA. Our findings indicate the need for further development of GEA approaches to optimize the use of imputed environmental data, such as incorporating regional GEA patterns instead of solely focusing on global associations between allele frequencies and environmental gradients across large-scale landscapes.Publication Exploiting genetic diversity in two European maize landraces for improving Gibberella ear rot resistance using genomic tools(2021) Gaikpa, David Sewordor; Kessel, Bettina; Presterl, Thomas; Ouzunova, Milena; Galiano-Carneiro, Ana L.; Mayer, Manfred; Melchinger, Albrecht E.; Schön, Chris-Carolin; Miedaner, ThomasFusarium graminearum (Fg) causes Gibberella ear rot (GER) in maize leading to yield reduction and contamination of grains with several mycotoxins. This study aimed to elucidate the molecular basis of GER resistance among 500 doubled haploid lines derived from two European maize landraces, “Kemater Landmais Gelb” (KE) and “Petkuser Ferdinand Rot” (PE). The two landraces were analyzed individually using genome-wide association studies and genomic selection (GS). The lines were genotyped with a 600-k maize array and phenotyped for GER severity, days to silking, plant height, and seed-set in four environments using artificial infection with a highly aggressive Fg isolate. High genotypic variances and broad-sense heritabilities were found for all traits. Genotype-environment interaction was important throughout. The phenotypic (r) and genotypic (rg) correlations between GER severity and three agronomic traits were low (r= − 0.27 to 0.20; rg = − 0.32 to 0.22). For GER severity, eight QTLs were detected in KE jointly explaining 34% of the genetic variance. In PE, no significant QTLs for GER severity were detected. No common QTLs were found between GER severity and the three agronomic traits. The mean prediction accuracies (p) of weighted GS (wRR-BLUP) were higher than p of marker-assisted selection (MAS) and unweighted GS (RR-BLUP) for GER severity. Using KE as the training set and PE as the validation set resulted in very low p that could be improved by using fixed marker effects in the GS model.Publication Genetic architecture underlying the expression of eight α-amylase trypsin inhibitors(2021) El Hassouni, Khaoula; Sielaff, Malte; Curella, Valentina; Neerukonda, Manjusha; Leiser, Willmar; Würschum, Tobias; Schuppan, Detlef; Tenzer, Stefan; Longin, C. Friedrich H.Amylase trypsin inhibitors (ATIs) are important allergens in baker’s asthma and suspected triggers of non-celiac wheat sensitivity (NCWS) inducing intestinal and extra-intestinal inflammation. As studies on the expression and genetic architecture of ATI proteins in wheat are lacking, we evaluated 149 European old and modern bread wheat cultivars grown at three different field locations for their content of eight ATI proteins. Large differences in the content and composition of ATIs in the different cultivars were identified ranging from 3.76 pmol for ATI CM2 to 80.4 pmol for ATI 0.19, with up to 2.5-fold variation in CM-type and up to sixfold variation in mono/dimeric ATIs. Generally, heritability estimates were low except for ATI 0.28 and ATI CM2. ATI protein content showed a low correlation with quality traits commonly analyzed in wheat breeding. Similarly, no trends were found regarding ATI content in wheat cultivars originating from numerous countries and decades of breeding history. Genome-wide association mapping revealed a complex genetic architecture built of many small, few medium and two major quantitative trait loci (QTL). The major QTL were located on chromosomes 3B for ATI 0.19-like and 6B for ATI 0.28, explaining 70.6 and 68.7% of the genotypic variance, respectively. Within close physical proximity to the medium and major QTL, we identified eight potential candidate genes on the wheat reference genome encoding structurally related lipid transfer proteins. Consequently, selection and breeding of wheat cultivars with low ATI protein amounts appear difficult requiring other strategies to reduce ATI content in wheat products.Publication Genetic dissection of drought tolerance in maize through GWAS of agronomic traits, stress tolerance indices, and phenotypic plasticity(2025) Li, Ronglan; Li, Dongdong; Guo, Yuhang; Wang, Yueli; Zhang, Yufeng; Li, Le; Yang, Xiaosong; Chen, Shaojiang; Würschum, Tobias; Liu, Wenxin; Han, De-GuoDrought severely limits crop yield every year, making it critical to clarify the genetic basis of drought tolerance for breeding of improved varieties. As drought tolerance is a complex quantitative trait, we analyzed three phenotypic groups: (1) agronomic traits under well-watered (WW) and water-deficit (WD) conditions, (2) stress tolerance indices of these traits, and (3) phenotypic plasticity, using a multi-parent doubled haploid (DH) population assessed in multi-environment trials. Genome-wide association studies (GWAS) identified 130, 171, and 71 quantitative trait loci (QTL) for the three groups of phenotypes, respectively. Only one QTL was shared among all trait groups, 25 between stress indices and agronomic traits, while the majority of QTL were specific to their group. Functional annotation of candidate genes revealed distinct pathways of the three phenotypic groups. Candidate genes under WD conditions were enriched for stress response and epigenetic regulation, while under WW conditions for protein synthesis and transport, RNA metabolism, and developmental regulation. Stress tolerance indices were enriched for transport of amino/organic acids, epigenetic regulation, and stress response, whereas plasticity showed enrichment for environmental adaptability. Transcriptome analysis of 26 potential candidate genes showed tissue-specific drought responses in leaves, ears, and tassels. Collectively, these results indicated both shared and independent genetic mechanisms underlying drought tolerance, providing novel insights into the complex phenotypes related to drought tolerance and guiding further strategies for molecular breeding in maize.Publication Genomic prediction in hybrid breeding: I. Optimizing the training set design(2023) Melchinger, Albrecht E.; Fernando, Rohan; Stricker, Christian; Schön, Chris-Carolin; Auinger, Hans-JürgenGenomic prediction holds great promise for hybrid breeding but optimum composition of the training set (TS) as determined by the number of parents (nTS) and crosses per parent (c) has received little attention. Our objective was to examine prediction accuracy (ra) of GCA for lines used as parents of the TS (I1 lines) or not (I0 lines), and H0, H1 and H2 hybrids, comprising crosses of type I0 × I0, I1 × I0 and I1 × I1, respectively, as function of nTS and c. In the theory, we developed estimates for ra of GBLUPs for hybrids: (i)r^a based on the expected prediction accuracy, and (ii) r~a based on ra of GBLUPs of GCA and SCA effects. In the simulation part, hybrid populations were generated using molecular data from two experimental maize data sets. Additive and dominance effects of QTL borrowed from literature were used to simulate six scenarios of traits differing in the proportion (τSCA = 1%, 6%, 22%) of SCA variance in σG2 and heritability (h2 = 0.4, 0.8). Values of r~a and r^a closely agreed with ra for hybrids. For given size NTS = nTS × c of TS, ra of H0 hybrids and GCA of I0 lines was highest for c = 1. Conversely, for GCA of I1 lines and H1 and H2 hybrids, c = 1 yielded lowest ra with concordant results across all scenarios for both data sets. In view of these opposite trends, the optimum choice of c for maximizing selection response across all types of hybrids depends on the size and resources of the breeding program.Publication Hybrid transcriptome sequencing approach improved assembly and gene annotation in Cynara cardunculus (L.)(2020) Puglia, Giuseppe D.; Prjibelski, Andrey D.; Vitale, Domenico; Bushmanova, Elena; Schmid, Karl J.; Raccuia, Salvatore A.Background: The investigation of transcriptome profiles using short reads in non-model organisms, which lack of well-annotated genomes, is limited by partial gene reconstruction and isoform detection. In contrast, long-reads sequencing techniques revealed their potential to generate complete transcript assemblies even when a reference genome is lacking. Cynara cardunculus var. altilis (DC) (cultivated cardoon) is a perennial hardy crop adapted to dry environments with many industrial and nutraceutical applications due to the richness of secondary metabolites mostly produced in flower heads. The investigation of this species benefited from the recent release of a draft genome, but the transcriptome profile during the capitula formation still remains unexplored. In the present study we show a transcriptome analysis of vegetative and inflorescence organs of cultivated cardoon through a novel hybrid RNA-seq assembly approach utilizing both long and short RNA-seq reads. Results: The inclusion of a single Nanopore flow-cell output in a hybrid sequencing approach determined an increase of 15% complete assembled genes and 18% transcript isoforms respect to short reads alone. Among 25,463 assembled unigenes, we identified 578 new genes and updated 13,039 gene models, 11,169 of which were alternatively spliced isoforms. During capitulum development, 3424 genes were differentially expressed and approximately two-thirds were identified as transcription factors including bHLH, MYB, NAC, C2H2 and MADS-box which were highly expressed especially after capitulum opening. We also show the expression dynamics of key genes involved in the production of valuable secondary metabolites of which capitulum is rich such as phenylpropanoids, flavonoids and sesquiterpene lactones. Most of their biosynthetic genes were strongly transcribed in the flower heads with alternative isoforms exhibiting differentially expression levels across the tissues. Conclusions: This novel hybrid sequencing approach allowed to improve the transcriptome assembly, to update more than half of annotated genes and to identify many novel genes and different alternatively spliced isoforms. This study provides new insights on the flowering cycle in an Asteraceae plant, a valuable resource for plant biology and breeding in Cynara and an effective method for improving gene annotation.Publication Optimum breeding strategies using genomic and phenotypic selection for the simultaneous improvement of two traits(2021) Marulanda, Jose J.; Mi, Xuefei; Utz, H. Friedrich; Melchinger, Albrecht E.; Würschum, Tobias; Longin, C. Friedrich H.Selection indices using genomic information have been proposed in crop-specific scenarios. Routine use of genomic selection (GS) for simultaneous improvement of multiple traits requires information about the impact of the available economic and logistic resources and genetic properties (variances, trait correlations, and prediction accuracies) of the breeding population on the expected selection gain. We extended the R package “selectiongain” from single trait to index selection to optimize and compare breeding strategies for simultaneous improvement of two traits. We focused on the expected annual selection gain (ΔGa) for traits differing in their genetic correlation, economic weights, variance components, and prediction accuracies of GS. For all scenarios considered, breeding strategy GSrapid (one-stage GS followed by one-stage phenotypic selection) achieved higher ΔGa than classical two-stage phenotypic selection, regardless of the index chosen to combine the two traits and the prediction accuracy of GS. The Smith–Hazel or base index delivered higher ΔGa for net merit and individual traits compared to selection by independent culling levels, whereas the restricted index led to lower ΔGa in net merit and divergent results for selection gain of individual traits. The differences among the indices depended strongly on the correlation of traits, their variance components, and economic weights, underpinning the importance of choosing the selection indices according to the goal of the breeding program. We demonstrate our theoretical derivations and extensions of the R package “selectiongain” with an example from hybrid wheat by designing indices to simultaneously improve grain yield and grain protein content or sedimentation volume.Publication Order from entropy: big data from FAIR data cohorts in the digital age of plant breeding(2025) Gogna, Abhishek; Arend, Daniel; Beier, Sebastian; Rezaei, Ehsan Eyshi; Würschum, Tobias; Zhao, Yusheng; Chu, Jianting; Reif, Jochen C.Lack of interoperable datasets in plant breeding research creates an innovation bottleneck, requiring additional effort to integrate diverse datasets—if access is possible at all. Handling of plant breeding data and metadata must, therefore, change toward adopting practices that promote openness, collaboration, standardization, ethical data sharing, sustainability, and transparency of provenance and methodology. FAIR Digital Objects, which build on research data infrastructures and FAIR principles, offer a path to address this interoperability crisis, yet their adoption remains in its infancy. In the present work, we identify data sharing practices in the plant breeding domain as Data Cohorts and establish their connection to FAIR Digital Objects. We further link these cohorts to broader research infrastructures and propose a Data Trustee model for federated data sharing. With this we aim to push the boundaries of data management, often viewed as the last step in plant breeding research, to an ongoing process to enable future innovations in the field.Publication Participatory research at scale: a comparative analysis of four approaches to large-scale agricultural technology testing with farmers(2024) Oberson, Nathalie; Moussa, Hannatou O; Aminou, Ali M; Kidane, Yosef Gebrehawaryat; Luo, Juliet Nangamba; Giuliani, Alessandra; Weltzien, Eva; Haussmann, Bettina IGTailoring agricultural technology options to the diverse conditions of smallholder farmers requires innovative approaches for testing these technologies with farmers across varied contexts, while incorporating their feedback into learning and decision-making processes. This study compares four such approaches: the Farmer Field School on Participatory Plant Breeding (FFS-PPB), Farmer Research Network (FRN), Crowdsourcing–Triadic comparisons of technologies (Tricot), and adapted Mother–Baby Trial (MBT) as implemented by four concrete projects. The objectives are to provide detailed descriptions of these approaches and their project-specific implementations, identify and analyze common aspects and differences, and derive insights to guide future farmer-inclusive projects aiming at contextual scaling of agricultural technologies. A literature review, key informant interviews, and a systematic content analysis were conducted for the analysis. Common features include cascade training models, simple farmer-managed experiments, and the use of digital tools for data collection. Major differences lie in the extent of farmer–researcher collaboration and decision-making, as well as how technology option-by-context interactions are addressed. The FRN, FFS-PPB, and adapted MBT approaches involve farmers in decision-making throughout most stages of research, including co-learning cycles that adapt the research design and technology options to farmers’ needs. Although these approaches require more training and expertise, they increase the likelihood of achieving relevant results that farmers can implement in practice. In contrast, more standardized approaches like the Crowdsourcing–Tricot streamline the implementation, data management and analysis of large-scale trials, but have limitations in capturing the underlying reasons for farmers’ preferences. Among the studied approaches, the FRN as implemented by the Women's Fields project in Niger is particularly effective in identifying which options best suit specific farming contexts.Publication The potential of hybrid breeding to enhance leaf rust and stripe rust resistance in wheat(2020) Beukert, Ulrike; Liu, Guozheng; Thorwarth, Patrick; Boeven, Philipp H. G.; Longin, C. Friedrich H.; Zhao, Yusheng; Ganal, Martin; Serfling, Albrecht; Ordon, Frank; Reif, Jochen C.Leaf rust and stripe rust belong to the most important fungal diseases in wheat production. Due to a dynamic development of new virulent races, epidemics appear in high frequency and causes significant losses in grain yield and quality. Therefore, research is needed to develop strategies to breed wheat varieties carrying highly efficient resistances. Stacking of dominant resistance genes through hybrid breeding is such an approach. Within this study, we investigated the genetic architecture of leaf rust and stripe rust resistance of 1750 wheat hybrids and their 230 parental lines using a genome-wide association study. We observed on average a lower rust susceptibility for hybrids in comparison to their parental inbred lines and some hybrids outperformed their better parent with up to 56%. Marker-trait associations were identified on chromosome 3D and 4A for leaf rust and on chromosome 2A, 2B, and 6A for stripe rust resistance by using a genome-wide association study with a Bonferroni-corrected threshold of P < 0.10. Detected loci on chromosomes 4A and 2A were located within previously reported genomic regions affecting leaf rust and stripe rust resistance, respectively. The degree of dominance was for most associations favorable in the direction of improved resistance. Thus, resistance can be increased in hybrid wheat breeding by fixing complementary leaf rust and stripe rust resistance genes with desired dominance effects in opposite parental pools.Publication Rapid cycling genomic selection in maize landraces(2025) Polzer, Clara; Auinger, Hans-Jürgen; Terán-Pineda, Michelle; Hölker, Armin C.; Mayer, Manfred; Presterl, Thomas; Rivera-Poulsen, Carolina; da Silva, Sofia; Ouzunova, Milena; Melchinger, Albrecht E.; Schön, Chris-CarolinKey message: A replicated experiment on genomic selection in a maize landrace provides valuable insights on the design of rapid cycling recurrent pre-breeding schemes and the factors contributing to their success. Abstract: The genetic diversity of landraces is currently underutilized for elite germplasm improvement. In this study, we investigated the potential of rapid cycling genomic selection for pre-breeding of a maize ( Zea mays L.) landrace population in replicated experiments. We trained the prediction model on a dataset (N = 899) composed of three landrace-derived doubled-haploid (DH) populations characterized for agronomic traits in 11 environments across Europe. All DH lines were genotyped with a 600 k SNP array. In two replications, three cycles of genomic selection and recombination were performed for line per se performance of early plant development, a major sustainability factor in maize production. From each cycle and replication, 100 DH lines were extracted. To evaluate selection response, the DH lines of all cycles and both replications (N = 688) were evaluated for per se performance of selected and unselected traits in seven environments. Selection was highly successful with an increase of about two standard deviations for traits under directional selection. Realized selection response was highest in the first cycle and diminished in following cycles. Selection gains predicted from genomic breeding values were only partially corroborated by realized gains estimated from adjusted means. Prediction accuracies declined sharply across cycles, but only for traits under directional selection. Retraining the prediction model with data from previous cycles improved prediction accuracies in cycles 2 and 3. Replications differed in selection response and particularly in accuracies. The experiment gives valuable insights with respect to the design of rapid cycling genomic selection schemes and the factors contributing to their success.Publication Using landscape genomics to infer genomic regions involved in environmental adaptation of soybean genebank accessions(2025) Haupt, Max; Schmid, KarlBackground: Understanding how crops adapt to specific environmental conditions is becoming increasingly important in the face of accelerating climate change, but the genetics of local adaptation remains little understood for many crops. Landscape genomics can reveal patterns of genetic variation that indicate adaptive diversification during crop evolution and dispersal. Here, we examine genetic differentiation and association signatures with environmental gradients in soybean ( Glycine max ) germplasm groups from China that were inferred from the USDA Soybean Germplasm Collection ( N = 17, 019 accessions) based on population structure and passport information. Results: We recover genes previously known to be involved in soybean environmental adaptation and report numerous new candidate genes in adaptation signatures implicated by genomic resources such as the genome annotation and gene expression datasets to function in flowering regulation, photoperiodism and stress reaction cascades. Linkage disequilibrium network analysis suggested functional relationships between genomic regions with signatures of genetic differentiation, consistent with a polygenic nature of environmental adaptation. We tested whether haplotypes associated with environmental adaptation in China were present in 843 North American and 160 European soybean cultivars and found that haplotypes in major genes for early maturity have been selected during breeding, but also that a large number of haplotypes exhibiting putative adaptive variation for cold regions at high latitudes are underrepresented in modern cultivars. Conclusions: Our results demonstrate the value of landscape genomics analysis of genebank accessions studying crop environmental adaptation and to inform future research and breeding efforts for improved adaptation of soybean and other crops to future climates.
