您的位置: 首页 > 院士专题 > 专题 > 详情页

Analysis of the genetic basis of fiber-related traits and flowering time in upland cotton using machine learning

利用机器学习分析陆地棉纤维品质和开花时间的遗传基础

关键词:
来源:
THEORETICAL AND APPLIED GENETICS
来源地址:
https://link.springer.com/article/10.1007/s00122-025-04821-2
类型:
学术文献
语种:
英语
原文发布日期:
2025-01-01
摘要:
Cotton is an important crop for fiber production, but the genetic basis underlying key agronomic traits, such as fiber quality and flowering days, remains complex. While machine learning (ML) has shown great potential in uncovering the genetic architecture of complex traits in other crops, its application in cotton has been limited. Here, we applied five machine learning models-AdaBoost, Gradient Boosting Regressor, LightGBM, Random Forest, and XGBoost-to identify loci associated with fiber quality and flowering days in cotton. We compared two SNP dataset down-sampling methods for model training and found that selecting SNPs with an Fscale value greater than 0 outperformed randomly selected SNPs in terms of model accuracy. We further performed machine learning quantitative trait loci (mlQTLs) analysis for 13 traits related to fiber quality and flowering days. These mlQTLs were then compared to those identified through genome-wide association studies (GWAS), revealing that the machine learning approach not only confirmed known loci but also identified novel QTLs. Additionally, we evaluated the effect of population size on model accuracy and found that larger population sizes resulted in better predictive performance. Finally, we proposed candidate genes for the identified mlQTLs, including two argonaute 5 proteins, Gh_A09G104100 and Gh_A09G104400, for the FL3/FS2 locus, as well as GhFLA17 and Syntaxin-121 (Gh_D09G143700) for the FSD09_2/FED09_2 locus. Our findings demonstrate the efficacy of machine learning in enhancing the identification of genetic loci in cotton, providing valuable insights for improving cotton breeding strategies.
相关推荐

意 见 箱

匿名:登录

个人用户登录

找回密码

第三方账号登录

忘记密码

个人用户注册

必须为有效邮箱
6~16位数字与字母组合
6~16位数字与字母组合
请输入正确的手机号码

信息补充