Efficient Enumeration of Regular Expressions for Faster Regular Expression Synthesis

Su Hyeon Kim, Hyeonseung Im, Sang Ki Ko

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

We study the problem of synthesizing regular expressions from a set of positive and negative strings. The previous synthesis algorithm proposed by Lee et al. [12] relies on the best-first enumeration of regular expressions. To improve the performance of the enumeration process, we define a new normal form of regular expressions called the concise normal form which allows us to significantly reduce the search space by pruning those not in the normal form while still capturing the whole class of regular languages. We conduct experiments with two benchmark datasets and demonstrate that our synthesis algorithm based on the proposed normal form outperforms the previous algorithm in terms of runtime complexity and scalability.

Original languageEnglish
Title of host publicationImplementation and Application of Automata - 25th International Conference, CIAA 2021, Proceedings
EditorsSebastian Maneth
PublisherSpringer Science and Business Media Deutschland GmbH
Pages65-76
Number of pages12
ISBN (Print)9783030791209
DOIs
StatePublished - 2021
Event25th International Conference on Implementation and Application of Automata, CIAA 2021 - Virtual, Online
Duration: 19 Jul 202122 Jul 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12803 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Implementation and Application of Automata, CIAA 2021
CityVirtual, Online
Period19/07/2122/07/21

Keywords

  • Enumerative search
  • Normal form
  • Program synthesis
  • Regular expression

Fingerprint

Dive into the research topics of 'Efficient Enumeration of Regular Expressions for Faster Regular Expression Synthesis'. Together they form a unique fingerprint.

Cite this