版权说明 操作指南
首页 > 成果 > 详情

Two halves of a meaningful text are statistically different

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Deng, Weibing;Xie, Rongrong;Deng, Shegfeng;Allahverdyan, Armen E.*
通讯作者:
Allahverdyan, Armen E.
作者机构:
[Deng, Shegfeng; Deng, Weibing; Xie, Rongrong] Cent China Normal Univ, Key Lab Quark & Lepton Phys MOE, Wuhan 430079, Peoples R China.
[Deng, Shegfeng; Deng, Weibing; Xie, Rongrong] Cent China Normal Univ, Inst Particle Phys, Wuhan 430079, Peoples R China.
[Allahverdyan, Armen E.] Yerevan Phys Inst, Alikhanian Bros St 2, Yerevan 375036, Armenia.
通讯机构:
[Allahverdyan, Armen E.] Y
Yerevan Phys Inst, Alikhanian Bros St 2, Yerevan 375036, Armenia.
语种:
英文
关键词:
data mining;inference in socio-economic system;scaling in socio-economic systems
期刊:
Journal of Statistical Mechanics: Theory and Experiment
ISSN:
1742-5468
年:
2021
卷:
2021
期:
3
基金类别:
Fundamental Research Funds for the Central UniversitiesFundamental Research Funds for the Central Universities; Program of Introducing Talents of Discipline to UniversitiesMinistry of Education, China - 111 Project [B08033]; National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [11505071, 11905163]; SCS of Armenia [18RF-015, 18T-1C090]
机构署名:
本校为第一机构
院系归属:
物理科学与技术学院
摘要:
Which statistical features distinguish a meaningful text (possibly written in an unknown system) from a meaningless set of symbols? Here we answer this question by comparing features of the first half of a text to its second half. This comparison can uncover hidden effects, because the halves have the same values of many parameters (style, genre, etc). We found that the first half has more different words and more rare words than the second half. Also, words in the first half are distributed less homogeneously over the text. These differences hold for the significant majority of several hundre...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com