TY - JOUR
T1 - Multiple cancer type classification by small RNA expression profiles with plasma samples from multiple facilities
AU - small RNA based cancer classification project
AU - Suzuki, Kuno
AU - Igata, Hideyoshi
AU - Abe, Motoki
AU - Yamamoto, Yusuke
AU - Iwanaga, Terunao
AU - Kanzaki, Hiroaki
AU - Kato, Naoya
AU - Tanaka, Nobuko
AU - Kawasaki, Kenji
AU - Matsushita, Kazuyuki
AU - Samashima, Ryu
AU - Tsukino, Keiji
AU - Yokomizo, Akira
AU - Miyashita, Yosuke
AU - Sumiyoshi, Issei
AU - Takahashi, Kazuhisa
AU - Serizawa, Nobuko
AU - Tomishima, Ko
AU - Nagahara, Akihito
AU - Ishizuka, Yumiko
AU - Horimoto, Yoshiya
AU - Nagata, Masayoshi
AU - Ishikawa, Keisuke
AU - Horie, Shigeo
AU - Shiina, Shuichiro
AU - Nasu, Motomi
AU - Hashimoto, Takashi
AU - Mine, Shinji
AU - Kawano, Shingo
AU - Sugimoto, Kiichi
AU - Sakamoto, Kazuhiro
AU - Takemura, Hiroyuki
AU - Wakita, Mitsuru
AU - Tabe, Yoko
AU - Kato, Shunsuke
AU - Miyagi, Yohei
AU - Adachi, Hiroyuki
AU - Isaka, Tetsuya
AU - Ito, Hiroyuki
AU - Yamanaka, Takashi
AU - Yoshida, Tatsuya
AU - Yamashita, Toshinari
AU - Ogata, Takashi
AU - Yamada, Takanobu
AU - Oshima, Takashi
AU - Yamamoto, Naoto
AU - Shigeyasu, Kunitoshi
AU - Noma, Kazuhiro
AU - Fujiwara, Toshiyoshi
AU - Morita, Mizuki
N1 - Funding Information:
We would like to thank the participants in this study for their involvement. We would like to thank Hiroyuki Hamada and Yuko Yamakage for NGS data acquisition; Hiroyuki Mano (National Cancer Center), Kosei Hasegawa (Saitama Medical University), Motohide Uemura (Osaka University), Nobuyuki Ota (Preferred Networks America), Shinya Fukumoto (Osaka City University), Shoji Nagao (Hyogo Cancer Center), andTamotsu Sudo (Hyogo Cancer Center), for research advice; and Kiyohide Ishikura, who is a liaison officer in medical facility relations.
Publisher Copyright:
© 2022 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.
PY - 2022/6
Y1 - 2022/6
N2 - Liquid biopsy is expected to be a promising cancer screening method because of its low invasiveness and the possibility of detecting multiple types in a single test. In the last decade, many studies on cancer detection using small RNAs in blood have been reported. To put small RNA tests into practical use as a multiple cancer type screening test, it is necessary to develop a method that can be applied to multiple facilities. We collected samples of eight cancer types and healthy controls from 20 facilities to evaluate the performance of cancer type classification. A total of 2,475 cancer samples and 496 healthy control samples were collected using a standardized protocol. After obtaining a small RNA expression profile, we constructed a classification model and evaluated its performance. First, we investigated the classification performance using samples from five single facilities. Each model showed areas under the receiver curve (AUC) ranging from 0.67 to 0.89. Second, we performed principal component analysis (PCA) to examine the characteristics of the facilities. The degree of hemolysis and the data acquisition period affected the expression profiles. Finally, we constructed the classification model by reducing the influence of these factors, and its performance had an AUC of 0.76. The results reveal that small RNA can be used for the classification of cancer types in samples from a single facility. However, interfacility biases will affect the classification of samples from multiple facilities. These findings will provide important insights to improve the performance of multiple cancer type classifications using small RNA expression profiles acquired from multiple facilities.
AB - Liquid biopsy is expected to be a promising cancer screening method because of its low invasiveness and the possibility of detecting multiple types in a single test. In the last decade, many studies on cancer detection using small RNAs in blood have been reported. To put small RNA tests into practical use as a multiple cancer type screening test, it is necessary to develop a method that can be applied to multiple facilities. We collected samples of eight cancer types and healthy controls from 20 facilities to evaluate the performance of cancer type classification. A total of 2,475 cancer samples and 496 healthy control samples were collected using a standardized protocol. After obtaining a small RNA expression profile, we constructed a classification model and evaluated its performance. First, we investigated the classification performance using samples from five single facilities. Each model showed areas under the receiver curve (AUC) ranging from 0.67 to 0.89. Second, we performed principal component analysis (PCA) to examine the characteristics of the facilities. The degree of hemolysis and the data acquisition period affected the expression profiles. Finally, we constructed the classification model by reducing the influence of these factors, and its performance had an AUC of 0.76. The results reveal that small RNA can be used for the classification of cancer types in samples from a single facility. However, interfacility biases will affect the classification of samples from multiple facilities. These findings will provide important insights to improve the performance of multiple cancer type classifications using small RNA expression profiles acquired from multiple facilities.
KW - liquid biopsy
KW - machine learning
KW - multiple cancer type classification
KW - multiple facilities
KW - NGS
KW - small RNA
UR - http://www.scopus.com/inward/record.url?scp=85132455290&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132455290&partnerID=8YFLogxK
U2 - 10.1111/cas.15309
DO - 10.1111/cas.15309
M3 - Article
C2 - 35218669
AN - SCOPUS:85132455290
SN - 1347-9032
VL - 113
SP - 2144
EP - 2166
JO - Cancer Science
JF - Cancer Science
IS - 6
ER -