Decision-theoretic rough sets explained

In the mathematical theory of decisions, decision-theoretic rough sets (DTRS) is a probabilistic extension of rough set classification. First created in 1990 by Dr. Yiyu Yao,[1] the extension makes use of loss functions to derive

style\alpha

and

style\beta

region parameters. Like rough sets, the lower and upper approximations of a set are used.

Definitions

The following contains the basic principles of decision-theoretic rough sets.

Conditional risk

Using the Bayesian decision procedure, the decision-theoretic rough set (DTRS) approach allows for minimum-risk decision making based on observed evidence. Let

styleA=\{a1,\ldots,am\}

be a finite set of

stylem

possible actions and let

style\Omega=\{w1,\ldots,ws\}

be a finite set of

s

states.

styleP(wj\mid[x])

iscalculated as the conditional probability of an object

stylex

being in state

stylewj

given the object description

style[x]

.

styleλ(ai\midwj)

denotes the loss, or cost, for performing action

styleai

when the state is

stylewj

.The expected loss (conditional risk) associated with taking action

styleai

is givenby:

R(ai\mid[x])=

s
\sum
j=1

λ(ai\midwj)P(wj\mid[x]).

Object classification with the approximation operators can be fitted into the Bayesian decision framework. Theset of actions is given by

styleA=\{aP,aN,aB\}

, where

styleaP

,

styleaN

, and

styleaB

represent the threeactions in classifying an object into POS(

styleA

), NEG(

styleA

), and BND(

styleA

) respectively. To indicate whether anelement is in

styleA

or not in

styleA

, the set of states is given by

style\Omega=\{A,Ac\}

. Let

styleλ(a\diamond\midA)

denote the loss incurred by taking action

stylea\diamond

when an object belongs to

styleA

, and let

styleλ(a\diamond\midAc)

denote the loss incurred by take the same action when the objectbelongs to

styleAc

.

Loss functions

Let

styleλPP

denote the loss function for classifying an object in

styleA

into the POS region,

styleλBP

denote the loss function for classifying an object in

styleA

into the BND region, and let

styleλNP

denote the loss function for classifying an object in

styleA

into the NEG region. A loss function

styleλ\diamond

denotes the loss of classifying an object that does not belong to

styleA

into the regions specified by

style\diamond

.

Taking individual can be associated with the expected loss

styleR(a\diamond\mid[x])

actions and can be expressed as:

styleR(aP\mid[x])=λPPP(A\mid[x])+λPNP(Ac\mid[x]),

styleR(aN\mid[x])=λNPP(A\mid[x])+λNNP(Ac\mid[x]),

styleR(aB\mid[x])=λBPP(A\mid[x])+λBNP(Ac\mid[x]),

where

styleλ\diamond(a\diamond\midA)

,

styleλ\diamond(a\diamond\midAc)

, and

style\diamond=P

,

styleN

, or

styleB

.

Minimum-risk decision rules

If we consider the loss functions

styleλPP\leqλBP<λNP

and

styleλNN\leqλBN<λPN

, the following decision rules are formulated (P, N, B):

styleP(A\mid[x])\geq\gamma

and

styleP(A\mid[x])\geq\alpha

, decide POS(

styleA

);

styleP(A\mid[x])\leq\beta

and

styleP(A\mid[x])\leq\gamma

, decide NEG(

styleA

);

style\beta\leqP(A\mid[x])\leq\alpha

, decide BND(

styleA

);

where,

\alpha=

λPNBN
(λBPBN)-(λPPPN)

,

\gamma=

λPNNN
(λNPNN)-(λPPPN)

,

\beta=

λBNNN
(λNPNN)-(λBPBN)

.

The

style\alpha

,

style\beta

, and

style\gamma

values define the three different regions, giving us an associated risk for classifying an object. When

style\alpha>\beta

, we get

style\alpha>\gamma>\beta

and can simplify (P, N, B) into (P1, N1, B1):

styleP(A\mid[x])\geq\alpha

, decide POS(

styleA

);

styleP(A\mid[x])\leq\beta

, decide NEG(

styleA

);

style\beta<P(A\mid[x])<\alpha

, decide BND(

styleA

).

When

style\alpha=\beta=\gamma

, we can simplify the rules (P-B) into (P2-B2), which divide the regions based solely on

style\alpha

:

styleP(A\mid[x])>\alpha

, decide POS(

styleA

);

styleP(A\mid[x])<\alpha

, decide NEG(

styleA

);

styleP(A\mid[x])=\alpha

, decide BND(

styleA

).

Data mining, feature selection, information retrieval, and classifications are just some of the applications in which the DTRS approach has been successfully used.

See also

References

  1. Yao. Y.Y.. Wong, S.K.M. . Lingras, P.. 1990. A decision-theoretic rough set model. Methodologies for Intelligent Systems, 5, Proceedings of the 5th International Symposium on Methodologies for Intelligent Systems. North-Holland. Knoxville, Tennessee, USA. 17–25.

External links