Unambiguous finite automaton explained

In automata theory, an unambiguous finite automaton (UFA) is a nondeterministic finite automaton (NFA) such that each word has at most one accepting path. Each deterministic finite automaton (DFA) is an UFA, but not vice versa. DFA, UFA, and NFA recognize exactly the same class of formal languages.On the one hand, an NFA can be exponentially smaller than an equivalent DFA. On the other hand, some problems are easily solved on DFAs and not on UFAs. For example, given an automaton A, an automaton A which accepts the complement of A can be computed in linear time when A is a DFA, whereas it is known that this cannot be done in polynomial time for UFAs. Hence UFAs are a mix of the worlds of DFA and of NFA; in some cases, they lead to smaller automata than DFA and quicker algorithms than NFA.

Formal definition

An NFA is represented formally by a 5-tuple,

A=(Q,\Sigma,\Delta,q0,F)

.An UFA is an NFA such that, for each word

w=a1a2...an

, there exists at most one sequence of states

r0,r1,...,rn

, in

Q

with the following conditions:

r0=q0

;

ri+1\in\Delta(ri,ai+1)

for

i=0,...n-1

;

rn\inF

.In words, those conditions state that, if

w

is accepted by

A

, there is exactly one accepting path, that is, one path from an initial state to a final state that is labelled by

w

.

Example

Let

L

be the set of words over the alphabet whose nth last letter is an

a

. The figures show a DFA and a UFA accepting this language for n=2.

The minimal DFA accepting

L

has 2n states, one for each subset of . There is an UFA of

n+1

states which accepts

L

: it guesses the nth last letter, and then verifies that only

n-1

letters remain. It is indeed unambiguous as there exists only one nth last letter.

Inclusion, universality, equivalence

Three PSPACE-hard problems for general NFA belong to PTIME for DFA and are now considered.

Inclusion

It is decidable in polynomial-time whether an UFA's language is a subset of another UFA's language.Let A and B be two UFAs. Let L(A) and L(B) be the languages accepted by those automata. Then L(A)⊆L(B) if and only if L(AB)=L(A), where AB denotes the Cartesian product automaton, which can be proven to be also unambiguous. Now, L(AB) is a subset of L(A) by construction; hence both sets are equal if and only if for each length n

N

, the number of words of length n in L(AB) is equal to the number of words of length n in L(A). It can be proved that is sufficient to check each n up to the product of the number of states of A and B.

The number of words of length n accepted by an automaton can be computed in polynomial time using dynamic programming, which ends the proof.[1]

Universality, equivalence

The problem of universality[2] and of equivalence,[3] also belong to PTIME, by reduction to the inclusion problem.

Checking whether an automaton is unambiguous

For a nondeterministic finite automaton

A

with

n

states and an

m

letter alphabet, it is decidable in time

O(n2m)

whether

A

is unambiguous.[4] It suffices to use a fixpoint algorithm to compute the set of pairs of states q and q' such that there exists a word w which leads both to q and to q' . The automaton is unambiguous if and only if there is no such a pair such that both states are accepting. There are Θ(n2) state pairs, and for each pair there are m letters to consider to resume the fixpoint algorithm, hence the computation time.

Some properties

i\in\{0,\ldots,n\}

, compute the number of words of size n-i having a run starting at q and ending in a final state. By contrast, the same problem is
  1. P-hard
for NFAs.

State complexity

See main article: article and State complexity.

Mathematical proofs that every UFA for a language needs a certain number of states were pioneered by Schmidt.[6] Leung proved that a DFA equivalent to an

n

-state UFA requires

2n

states in the worst case, and that a UFA equivalent to a finitely ambiguous[7]

n

-state NFA requires

2n-1

states in the worst case.[8]

Jirásek, Jirásková and Šebej[9] researched state complexity of basic regular operations on languages represented by UFA. They proved in particular that for every

n

-state UFA where

n\geq7

, the complement of the language it accepts is accepted by a UFA with at most

20.79n+log

states. This result was later improved by Indzhev and Kiefer[10] to at most

\sqrt{n+1}20.5n

states for all

n\geq0

.

Raskin[11] showed that UFAs cannot be complemented in polynomial time, even into NFAs: he shows that, in the worst case, complementing a UFA with n states into an NFA requires a superpolynomial number of states. This lower bound was later improved by Göös, Kiefer, and Yuan.[12]

For a one-letter alphabet Okhotin proved that a DFA equivalent to an

n

-state UFA requires

\exp\left(\Theta\left(\sqrt[3]{n(lnn)2}\right)\right)

states in the worst case.[13]

References

Notes and References

  1. Christof Löding, Unambiguous Finite Automata, Slide 8
  2. i.e.: given a UFA, does it accept every string of Σ*?
  3. i.e.: given two UFAs, do they accept the same set of strings?
  4. Book: Sakarovitch. Jacques. Thomas. Reuben. Elements of Automata Theory. October 2009 . Cambridge university press. Cambridge. 978-0-521-84425-3. 75.
  5. Christof Löding, Unambiguous Finite Automata, Slide 8
  6. Ph.D. . Schmidt . Erik M. . 1978 . Succinctness of Description of Context-Free, Regular and Unambiguous Languages . Cornell University.
  7. Having finitely many accepting paths for every accepted word.
  8. Leung. Hing. Descriptional complexity of NFA of different ambiguity. International Journal of Foundations of Computer Science. 16. 5. 2005. 975–984. 0129-0541. 10.1142/S0129054105003418.
  9. Book: Jirásek. Jozef. Jirásková. Galina. Šebej. Juraj. Developments in Language Theory . Operations on Unambiguous Finite Automata. Lecture Notes in Computer Science . 9840. 2016. 243–255. 0302-9743. 10.1007/978-3-662-53132-7_20. 978-3-662-53131-0 .
  10. Indzhev. Emil. Kiefer. Stefan. 2105.07470. On Complementing Unambiguous Automata and Graphs With Many Cliques and Cocliques. cs.FL. 2021.
  11. Raskin . Mikhail . 2018 . A Superpolynomial Lower Bound for the Size of Non-Deterministic Complement of an Unambiguous Automaton . DROPS-IDN/V2/Document/10.4230/LIPIcs.ICALP.2018.138 . en . Schloss-Dagstuhl - Leibniz Zentrum für Informatik . 10.4230/LIPIcs.ICALP.2018.138. free .
  12. Göös . Mika . Kiefer . Stefan . Yuan . Weiqiang . 2022 . Lower Bounds for Unambiguous Automata via Communication Complexity . DROPS-IDN/V2/Document/10.4230/LIPIcs.ICALP.2022.126 . en . Schloss-Dagstuhl - Leibniz Zentrum für Informatik . 10.4230/LIPIcs.ICALP.2022.126. free .
  13. Okhotin . Alexander . Unambiguous finite automata over a unary alphabet . Information and Computation . 212 . 2012 . 15–36 . 0890-5401 . 10.1016/j.ic.2012.01.003. free .