Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс: http://elar.urfu.ru/handle/10995/73934
Полная запись метаданных
Поле DCЗначениеЯзык
dc.contributor.authorZenkov, A. V.en
dc.date.accessioned2019-06-26T07:41:37Z-
dc.date.available2019-06-26T07:41:37Z-
dc.date.issued2017-
dc.identifier.citationZenkov A. V. A novel method of stylometry based on the statistic of numerals / A. V. Zenkov // Computer Research and Modeling. — 2017. — Vol. 9. — Iss. 5. — P. 837-850. — DOI: 10.20537/2076-7633-2017-9-5-837-850.en
dc.identifier.issn2076-7633-
dc.identifier.issn2077-6853-
dc.identifier.otherhttp://crm.ics.org.ru/uploads/crmissues/crm_2017_5/2017_05_12.pdfpdf
dc.identifier.other1good_DOI
dc.identifier.other89960eec-485b-447e-b771-df84fc3d420bpure_uuid
dc.identifier.otherhttp://www.scopus.com/inward/record.url?partnerID=8YFLogxK&scp=85044141384m
dc.identifier.urihttp://elar.urfu.ru/handle/10995/73934-
dc.description.abstractA new method of statistical analysis of texts is suggested. The frequency distribution of the first significant digits in numerals of English-language texts is considered. We have taken into account cardinal as well as ordinal numerals expressed both in figures, and verbally. To identify the author's use of numerals, we previously deleted from the text all idiomatic expressions and set phrases accidentally containing numerals, as well as itemizations and page numbers, etc. Benford's law is found to hold approximately for the frequencies of various first significant digits of compound literary texts by different authors; a marked predominance of the digit 1 is observed. In coherent authorial texts, characteristic deviations from Benford's law arise which are statistically stable significant author peculiarities that allow, under certain conditions, to consider the problem of authorship and distinguish between texts by different authors. The text should be large enough (at least about 200 kB). At the end of {1, 2, ⋯, 9} digits row, the frequency distribution is subject to strong fluctuations and thus unrepresentative for our purpose. The aim of the theoretical explanation of the observed empirical regularity is not intended, which, however, does not preclude the applicability of the proposed methodology for text attribution. The approach suggested and the conclusions are backed by the examples of the computer analysis of works by W.M. Thackeray, M. Twain, R. L. Stevenson, J. Joyce, sisters Bront.e, and J.Austen. On the basis of technique suggested, we examined the authorship of a text earlier ascribed to L. F. Baum (the result agrees with that obtained by different means). We have shown that the authorship of Harper Lee's "To Kill a Mockingbird" pertains to her, whereas the primary draft, "Go Set a Watchman", seems to have been written in collaboration with Truman Capote. All results are confirmed on the basis of parametric Pearson's chi-squared test as well as non-parametric Mann-Whitney U test and Kruskal-Wallis test. Copyright © 2018 Institute of Computer Science.en
dc.format.mimetypeapplication/pdfen
dc.language.isoruen
dc.publisherIzhevsk Institute of Computer Scienceen
dc.rightscc-by-ndother
dc.sourceComputer Research and Modelingen
dc.subjectFIRST SIGNIFICANT DIGIT OF NUMERALSen
dc.subjectTEXT ATTRIBUTIONen
dc.titleA novel method of stylometry based on the statistic of numeralsen
dc.typeinfo:eu-repo/semantics/publishedVersionen
dc.typeinfo:eu-repo/semantics/articleen
dc.typeArticleen
dc.identifier.rsi30604788-
dc.identifier.doi10.20537/2076-7633-2017-9-5-837-850-
dc.identifier.scopus85044141384-
local.affiliationUral Federal University, Mira st. 19, Ekaterinburg, 620002, Russian Federationen
local.affiliationUral State University of Economics, 8th of March st. 62, Ekaterinburg, 620144, Russian Federationen
local.contributor.employeeЗенков Андрей Вячеславовичru
local.description.firstpage837-
local.description.lastpage850-
local.issue5-
local.volume9-
local.contributor.departmentИнститут "Высшая школа экономики и менеджмента"ru
local.identifier.pure7018732-
local.identifier.eid2-s2.0-85044141384-
Располагается в коллекциях:Научные публикации ученых УрФУ, проиндексированные в SCOPUS и WoS CC

Файлы этого ресурса:
Файл Описание РазмерФормат 
10.20537-2076-7633-2017-9-5-837-850.pdf19,9 MBAdobe PDFПросмотреть/Открыть


Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.