Browse Articles

Article|25 Jan 2025|OPEN
Genomic insights into the domestication and genetic basis of yield in papaya 
Min Yang1,3 , , Xiangdong Kong2,3 , Chenping Zhou1 , Ruibin Kuang1 , Xiaming Wu1 , Chuanhe Liu1 and Han He1 , Ze Xu1 , Yuerong Wei,1 ,
1Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences; Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Science and Technology Research on Fruit Tree, No. 80, Dafeng 2nd Street, Tianhe District, Guangzhou 510640, China
2JiguangGene Biotechnology Co., Ltd., No. 9, Huida Road, Pukou District, Nanjing 210031, China
3These authors contributed equally: Min Yang, Xiangdong Kong
*Corresponding author. E-mail: yangmin@gdaas.cn,weiyuerong@gdaas.cn

Horticulture Research 12,
Article number: uhaf045 (2025)
doi: https://doi.org/10.1093/hr/uhaf045
Views: 1361

Received: 17 Oct 2024
Accepted: 25 Jan 2025
Published online: 25 Jan 2025

Abstract

Papaya (Carica papaya L.) is an important tropical and subtropical fruit crop, and understanding its genome is essential for breeding. In this study, we assembled a high-quality genome of 344.17 Mb for the newly cultivated papaya ‘Zihui’, which contains 22 250 protein-coding genes. By integrating 201 resequenced papaya genomes, we identified four distinct papaya groups and a 34 Mb genomic region with strong domestication selection signals. Within these regions, two key genes associated with papaya yield were discovered: Cp_zihui06549, encoding a leucine-rich receptor-like protein kinase, and Cp_zihui06768, encoding the accumulation of photosystem one 1 (APO1) protein. Heterologous expression of Cp_zihui06549 in tomato confirmed that the total number of fruits in transgenic lines more than doubled compared to wild-type plants, resulting in a significant yield increase. Furthermore, we constructed a pan-genome of papaya and obtained a 77.41 Mb nonreference sequence containing 1543 genes. Within this pan-genome, 2483 variable genes, we detected, including four genes annotated as the ‘terpene synthase activity’ Gene Ontology term, which were lost in cultivars during domestication. Finally, gene retention analyses were performed using gene presence and absence variation data and differentially expressed genes across various tissues and organs. This study provides valuable insights into the genes and loci associated with phenotypes and domestication processes, laying a solid foundation for future papaya breeding efforts.