TY - JOUR
T1 - The edit-distance between a regular language and a context-free language
AU - Han, Yo Sub
AU - Ko, Sang Ki
AU - Salomaa, Kai
PY - 2013/11
Y1 - 2013/11
N2 - The edit-distance between two strings is the smallest number of operations required to transform one string into the other. The distance between languages L1 and L2 is the smallest edit-distance between string wi ∈ Li, i = 1, 2. We consider the problem of computing the edit-distance of a given regular language and a given context-free language. First, we present an algorithm that finds for the languages an optimal alignment, that is, a sequence of edit operations that transforms a string in one language to a string in the other. The length of the optimal alignment, in the worst case, is exponential in the size of the given grammar and finite automaton. Then, we investigate the problem of computing only the edit-distance of the languages without explicitly producing an optimal alignment. We design a polynomial time algorithm that calculates the edit-distance based on unary homomorphisms.
AB - The edit-distance between two strings is the smallest number of operations required to transform one string into the other. The distance between languages L1 and L2 is the smallest edit-distance between string wi ∈ Li, i = 1, 2. We consider the problem of computing the edit-distance of a given regular language and a given context-free language. First, we present an algorithm that finds for the languages an optimal alignment, that is, a sequence of edit operations that transforms a string in one language to a string in the other. The length of the optimal alignment, in the worst case, is exponential in the size of the given grammar and finite automaton. Then, we investigate the problem of computing only the edit-distance of the languages without explicitly producing an optimal alignment. We design a polynomial time algorithm that calculates the edit-distance based on unary homomorphisms.
KW - Context-free languages
KW - Edit-distance
KW - Levenshtein distance
KW - Regular languages
UR - http://www.scopus.com/inward/record.url?scp=84897653290&partnerID=8YFLogxK
U2 - 10.1142/S0129054113400315
DO - 10.1142/S0129054113400315
M3 - Article
AN - SCOPUS:84897653290
SN - 0129-0541
VL - 24
SP - 1067
EP - 1082
JO - International Journal of Foundations of Computer Science
JF - International Journal of Foundations of Computer Science
IS - 7
ER -