Rational expressions¶

# We disable autosave for technical reasons.
# Replace 0 by 120 in next line to restore default.
%autosave 0

Autosave disabled

import awalipy # If import fails, check that 
               # Python version used as Jupyter
               # kernel matches the one
               # Awalipy was compiled with.

[Warning] The python module awalipy relies on compilation executed "on-the-fly" depending on the context (type of weights, of labels, etc.). As a result, the very first call to a given function in a given context may take up to 10 seconds.

Creating a RatExp¶

When parsing a rational expression operator precedence is : star > concatenation > union . In other words,

a+(b*) = a+b* != (a+b)*
a(b*) = ab* != (ab)*
a+(bc) = a+bc != (a+b)c

e = awalipy.RatExp("(a+bc)c*(ab)*")
e

(a+bc)c*(ab)*

By default, the alphabet of a rational expression is the set of all letters appearing in it. However the alphabet may be increased artifically as follows.

f = awalipy.RatExp("(a+b)(c*+a)*", alphabet="abcd")
f

(a+b)(c*+a)*

Displaying a rational expression as a tree.

e.display()

Union¶

e+f

(a+bc)c*(ab)*+(a+b)(c*+a)*

e+=e
e

(a+bc)c*(ab)*+(a+bc)c*(ab)*

Concatenation¶

e^f

((a+bc)c*(ab)*+(a+bc)c*(ab)*)((a+b)(c*+a)*)

e^="abc*"
e

((a+bc)c*(ab)*+(a+bc)c*(ab)*)(abc*)

Star¶

e.star()

(((a+bc)c*(ab)*+(a+bc)c*(ab)*)(abc*))*

e.star_here()
e

(((a+bc)c*(ab)*+(a+bc)c*(ab)*)(abc*))*

Star normal form and star height¶

e.star_height()

2

e.star_normal_form()

(((a+bc)c*(ab)*+(a+bc)c*(ab)*)(abc*))*

Expand¶

The method expand distribute union and concatenation as much as possible.

awalipy.RatExp("(a+bc)(d+e)(f+g)*").expand()

ad(f+g)*+ae(f+g)*+bcd(f+g)*+bce(f+g)*

Expressions to automata¶

By default, awali uses the derived term algorithm.

A = e.exp_to_aut()
A.display()

The states of A are indeed all the derived expressions of e. It may be displayed by setting to True the optional argument history.

A.display(horizontal=False,history=True)

For convenience, one may give an expression to the constructor of an automaton. Derived term is called.

A = awalipy.Automaton(awalipy.RatExp("01*0*"))
A.display()

Awali implements other algorithms for transforming expressions to automata, such as thompson or standard

g = awalipy.RatExp("1*0")
g.thompson().display()

g.standard().display()

Weighted rational expression¶

Weights must be put between "<>" and weights takes precedence over other operators:

<-1>a* = (<-1>a)* != <-1>(a*)
<-1>ab = (<-1>a)b != <-1>(ab)
<-1>a+b = (<-1>a)+b != <-1>(a+b)

The weighset must be given as a second argument at creation.

h = awalipy.RatExp("(<1>a*+<-1>(b*))","Z")
h

a*+<-1>(b*)

h.display()

For the sake of convenience, a weight alone (ie. "<-1>") is considered as a valid representation of the word epsilon with the given weight (ie. "<-1>\e").

awalipy.RatExp("<-2>","Z")

<-2>\e

Union, concatenation and star works in the same way for weighted rational expressions.

i = h ^ h + ("<-1>" ^ h).star()
i

(a*+<-1>(b*))(a*+<-1>(b*)+<-1>(a*+<-1>(b*))*)

Weighted expression to weighted automaton¶

For aut_to_exp or standard to work, the rational expression needs to be valid. An expression is valid if, in every sub-expression, the weight of $\epsilon$ is well defined. For instance the expression *(< 2 >\e)** is not valid (with weightset $(\mathbb{Z},+,\times$))

i.is_valid()

True

i.exp_to_aut().display()

The method thompson() is not suitable for weighted expressions.

Indeed, let us consider the following valid expression g:

g = awalipy.RatExp("(<1>(a*)+<-1>(b*))*","Z")
g.is_valid()

True

G = g.thompson()
G.display(horizontal=False)

In this case, thompson produces an automaton that is not valid.

G.is_valid()

False

Other functions¶

The method constant_term gives the weight of epsilon

j = awalipy.RatExp("(<1/4>(a*)+<1/4>(b*))*","Q")
j

(<1/4>(a*)+<1/4>(b*))*

j.constant_term()

'2'

j.get_weightset()

Q