Skip to main navigation Skip to search Skip to main content

An encoding for order-preserving matching

  • Travis Gagie
  • , Giovanni Manzini
  • , Rossano Venturini

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Encoding data structures store enough information to answer the queries they are meant to support but not enough to recover their underlying datasets. In this paper we give the first encoding data structure for the challenging problem of order-preserving pattern matching. This problem was introduced only a few years ago but has already attracted significant attention because of its applications in data analysis. Two strings are said to be an order-preserving match if the relative order of their characters is the same: E.g., 4, 1, 3, 2 and 10, 3, 7, 5 are an order preserving match. We show how, given a string S[1..n] over an arbitrary alphabet of size σ and a constant c ≥ 1, we can build an O(n log log n)-bit encoding such that later, given a pattern P[1..m] with m ≤ logcn, we can return the number of order-preserving occurrences of P in S in O(m) time. Within the same time bound we can also return the starting position of some order preserving match for P in S (if such a match exists). We prove that our space bound is within a constant factor of optimal if logσ=ω (log log n); our query time is optimal if logσ= (log n). Our space bound contrasts with the ω (n log n) bits needed in the worst case to store S itself, an index for order-preserving pattern matching with no restrictions on the pattern length, or an index for standard pattern matching even with restrictions on the pattern length. Moreover, we can build our encoding knowing only how each character compares to O(logc n) neighbouring characters.

Original languageEnglish
Title of host publication25th European Symposium on Algorithms, ESA 2017
EditorsChristian Sohler, Christian Sohler, Kirk Pruhs
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959770491
DOIs
Publication statusPublished - 1 Sept 2017
Externally publishedYes
Event25th European Symposium on Algorithms, ESA 2017 - Vienna, Austria
Duration: 4 Sept 20176 Sept 2017

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume87
ISSN (Print)1868-8969

Conference

Conference25th European Symposium on Algorithms, ESA 2017
Country/TerritoryAustria
CityVienna
Period4/09/176/09/17

Keywords

  • Compact data structures
  • Encodings
  • Order-preserving matching

Fingerprint

Dive into the research topics of 'An encoding for order-preserving matching'. Together they form a unique fingerprint.

Cite this