作者机构:
[Liu, Rui; Liu, R; Yao, Xinjing; Wang, Yujun] Cent China Normal Univ, Sch Informat Management, Wuhan, Hubei, Peoples R China.;[Liu, Chang] Chinese Acad Med Sci, Inst Med Plant Dev, Beijing, Peoples R China.
通讯机构:
[Liu, R ; Liu, C ] C;Cent China Normal Univ, Sch Informat Management, Wuhan, Hubei, Peoples R China.;Chinese Acad Med Sci, Inst Med Plant Dev, Beijing, Peoples R China.
摘要:
A DNA barcode is a short piece of standard DNA sequence used for species determination and discrimination. Representation of DNA barcodes is essential for DNA barcodes' applications in the transportation and recognition of biological materials. Previously, we have compared different strategies for representing the DNA barcodes. In the present study, we have developed a compression algorithm based on binary coding or Huffman coding scheme, followed by converting the binary digits into Base64 digits. The combination of this compression algorithm and the QR representation leads to the dynamic DNA QR coding algorithm (DDQR). We tested the DDQR algorithm on simulated data and real DNA barcode sequences from the commonly used plant and animal DNA barcode markers: rbcL, matK, trnH-psbA, ITS2, and COI. We compared the compression efficiency of DDQR and another state-of-the-art DNA compression algorithm GeCo3 for sequences with various base compositions and lengths. We found that DDQR had a higher compression rate than GeCo3 for DNA sequences shorter than 800 bp, which is the typical size range for DNA barcodes. We also upgraded a web server (http://www.1kmpg.cn/ddqr) that provides three functions: retrieval of DNA barcode sequences, encoding DNA barcode sequences to DDQR codes, and decoding DDQR codes to DNA barcode sequences. The DDQR algorithm and the webserver will be invaluable to applying DNA barcode technology in the food and traditional medicine industries.
摘要:
Users' search performance indicates the effectiveness and success with which users' information needs are met, which is calculated based on the relevance judgment by users themselves. This study proposed to explore the prediction of users' search performance in the context of cross-device search. A user experiment was performed to collect users' relevance judgments and search behaviors in cross-device search. Based on users' relevance judgments, users' search performance was evaluated by calculating the percentage of valid clicks, effective search time, nDCG@n, and satisfaction. A simple linear regression model was adopted to train the prediction model. The final results showed that a combination of users' search performance in pre-switch sessions and their search behavior in post-switch sessions can attain the best prediction accuracy. Important features to predict users' search performance in cross-device search shed light on improving search systems to aid users in completing the task efficiently.
摘要:
To reduce the conceptual ambiguity in interdisciplinary knowledge organization systems (KOSs) and enhance interdisciplinary KOS management, this paper proposes a framework for interdisciplinary semantic drift (ISD) detection based on the normal cloud model (NCM). In this framework, we first analyze the features of interdisciplinary concepts and propose a novel interdisciplinary concept extraction method based on cross-discipline statistical information. Secondly, the high-performance knowledge representation model NCM is adopted to represent each interdisciplinary concept with uncertainty, and then a new ISD degree calculation method is proposed based on the similarity cloud algorithm. Thirdly, to identify the direction of ISD after the degree calculation, we propose an ISD direction identification method according to the theory of knowledge potential energy (KPE). Fourthly, based on the above procedure, we propose an ISD detection algorithm to identify and visualize the ISD process. Finally, we evaluate the proposed framework on the concept of "information entropy" and compare the performance with three baselines. Experimental results demonstrate that our framework outperforms[ all the baselines, and the result is comparable to experts' judgments (0.808 on Spearman correlation, p<0.001). The research indicates the meaning of an interdisciplinary concept will drift from the high KPE discipline to the low KPE discipline as long as interdisciplinary knowledge potential differences (KPD) exist between these two related disciplines. We further identify three key factors that affect the degree of ISD: the length of the discipline chain of an interdisciplinary concept transfer, the number of source disciplines that an interdisciplinary concept comes from, and the knowledge distance between the source discipline and the target discipline. & COPY; 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access