Subtrajectory Clustering: Finding Set Covers for Set Systems of Subcurves

Authors

  • Frederik Brüning University of Bonn
  • Hugo Akitaya University of Massachusetts Lowel
  • Erin Chambers Saint Louis University
  • Anne Driemel University of Bonn

DOI:

https://doi.org/10.57717/cgt.v2i1.7

Abstract

We study subtrajectory clustering under the Fréchet distance. Given one or more trajectories, the task is to split the trajectories into several parts, such that the parts have a good clustering structure. We approach this problem via a new set cover formulation, which we think provides a natural formalization of the problem as it is studied in many applications. Given a polygonal curve P with n vertices in fixed dimension, integers k, ℓ ≥ 1, and a real value Δ > 0, the goal is to find k center curves of complexity at most ℓ such that every point on P is covered by a subtrajectory that has small Fréchet distance to one of the k center curves (≤ Δ). In many application scenarios, one is interested in finding clusters of small complexity, which is controlled by the parameter ℓ. Our main result is a bicriterial approximation algorithm: if there exists a solution for given parameters k, ℓ, and Δ, then our algorithm finds a set of k' center curves of complexity at most ℓ with covering radius Δ' with k' in O(kℓ2 log (kℓ)), and Δ' ≤ 19Δ. Moreover, within these approximation bounds, we can minimize k while keeping the other parameters fixed. If ℓ is a constant independent of n, then, the approximation factor for the number of clusters k is O(log k) and the approximation factor for the radius Δ is constant. In this case, the algorithm has expected running time in Õ(km2 + mn) and uses space in O(n + m), where m=⌈L/Δ⌉ and L is the total arclength of the curve P.

Downloads

Published

2023-02-23

How to Cite

Brüning, F., Akitaya, H., Chambers, E., & Driemel, A. (2023). Subtrajectory Clustering: Finding Set Covers for Set Systems of Subcurves. Computing in Geometry and Topology, 2(1), 1:1–1:48. https://doi.org/10.57717/cgt.v2i1.7

Issue

Section

Original Research Articles

Categories