The Normalized Edit Distance with Uniform Operation Costs Is a Metric

Dana Fisman, Joshua Grogin, Oded Margalit, Gera Weiss

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We prove that the normalized edit distance proposed in [Marzal and Vidal 1993] is a metric when the cost of all the edit operations are the same. This closes a long standing gap in the literature where several authors noted that this distance does not satisfy the triangle inequality in the general case, and that it was not known whether it is satisfied in the uniform case - where all the edit costs are equal. We compare this metric to two normalized metrics proposed as alternatives in the literature, when people thought that Marzal's and Vidal's distance is not a metric, and identify key properties that explain why the original distance, now known to also be a metric, is better for some applications. Our examination is from a point of view of formal verification, but the properties and their significance are stated in an application agnostic way.

Original languageAmerican English
Title of host publication33rd Annual Symposium on Combinatorial Pattern Matching, CPM 2022
EditorsHideo Bannai, Jan Holub
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959772341
DOIs
StatePublished - 1 Jun 2022
Event33rd Annual Symposium on Combinatorial Pattern Matching, CPM 2022 - Prague, Czech Republic
Duration: 27 Jun 202229 Jun 2022

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume223

Conference

Conference33rd Annual Symposium on Combinatorial Pattern Matching, CPM 2022
Country/TerritoryCzech Republic
CityPrague
Period27/06/2229/06/22

Keywords

  • edit distance
  • metric
  • normalized distance
  • triangle inequality

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'The Normalized Edit Distance with Uniform Operation Costs Is a Metric'. Together they form a unique fingerprint.

Cite this