Please use this identifier to cite or link to this item: http://elar.urfu.ru/handle/10995/111669
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKosolobov, D.en
dc.contributor.authorValenzuela, D.en
dc.date.accessioned2022-05-12T08:20:18Z-
dc.date.available2022-05-12T08:20:18Z-
dc.date.issued2021-
dc.identifier.citationKosolobov D. Lempel-Ziv Parsing for Sequences of Blocks / D. Kosolobov, D. Valenzuela // Algorithms. — 2021. — Vol. 14. — Iss. 12. — 359.en
dc.identifier.issn1999-4893-
dc.identifier.otherAll Open Access, Gold, Green3
dc.identifier.urihttp://elar.urfu.ru/handle/10995/111669-
dc.description.abstractThe Lempel-Ziv parsing (LZ77) is a widely popular construction lying at the heart of many compression algorithms. These algorithms usually treat the data as a sequence of bytes, i.e., blocks of fixed length 8. Another common option is to view the data as a sequence of bits. We investigate the following natural question: what is the relationship between the LZ77 parsings of the same data interpreted as a sequence of fixed-length blocks and as a sequence of bits (or other “elementary” letters)? In this paper, we prove that, for any integer b > 1, the number z of phrases in the LZ77 parsing of a string of length n and the number zb of phrases in the LZ77 parsing of the same string in which blocks of length b are interpreted as separate letters (e.g., b = 8 in case of bytes) are related as zb = O(bz lognz ). The bound holds for both “overlapping” and “non-overlapping” versions of LZ77. Further, we establish a tight bound zb = O(bz) for the special case when each phrase in the LZ77 parsing of the string has a “phrase-aligned” earlier occurrence (an occurrence equal to the concatenation of consecutive phrases). The latter is an important particular case of parsing produced, for instance, by grammar-based compression methods. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.en
dc.description.sponsorshipFunding: This research was funded by the Ministry of Science and Higher Education of the Russian Federation (Ural Mathematical Center project No. 075-02-2021-1387).en
dc.format.mimetypeapplication/pdfen
dc.language.isoenen
dc.publisherMDPIen1
dc.publisherMDPI AGen
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.sourceAlgorithms2
dc.sourceAlgorithmsen
dc.subjectBLOCKSen
dc.subjectGRAMMARen
dc.subjectLEMPEL-ZIVen
dc.subjectLZ77en
dc.subjectSLPen
dc.subjectBLOCKen
dc.subjectCOMPRESSION ALGORITHMSen
dc.subjectCOMPRESSION METHODSen
dc.subjectGRAMMARen
dc.subjectGRAMMAR-BASED COMPRESSIONen
dc.subjectLEMPEL-ZIVen
dc.subjectLZ77en
dc.subjectSLPen
dc.subjectTIGHT BOUNDen
dc.titleLempel-Ziv Parsing for Sequences of Blocksen
dc.typeArticleen
dc.typeinfo:eu-repo/semantics/articleen
dc.typeinfo:eu-repo/semantics/publishedVersionen
dc.identifier.doi10.3390/a14120359-
dc.identifier.scopus85121426311-
local.contributor.employeeKosolobov, D., Department of Physical and Mathematical Sciences, Ural Federal University, Ekaterinburg, 620000, Russian Federation; Valenzuela, D., Department of Computer Science, University of Helsinki, Helsinki, FI-00014, Finlanden
local.issue12-
local.volume14-
dc.identifier.wos000913789500001-
local.contributor.departmentDepartment of Physical and Mathematical Sciences, Ural Federal University, Ekaterinburg, 620000, Russian Federation; Department of Computer Science, University of Helsinki, Helsinki, FI-00014, Finlanden
local.identifier.pure29146409-
local.description.order359-
local.identifier.eid2-s2.0-85121426311-
local.identifier.wosWOS:000913789500001-
Appears in Collections:Научные публикации ученых УрФУ, проиндексированные в SCOPUS и WoS CC

Files in This Item:
File Description SizeFormat 
2-s2.0-85121426311.pdf330,09 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.