Rough set explained
In computer science, a rough set, first described by Polish computer scientist Zdzisław I. Pawlak, is a formal approximation of a crisp set (i.e., conventional set) in terms of a pair of sets which give the lower and the upper approximation of the original set. In the standard version of rough set theory described in Pawlak (1991),[1] the lower- and upper-approximation sets are crisp sets, but in other variations, the approximating sets may be fuzzy sets.
Definitions
The following section contains an overview of the basic framework of rough set theory, as originally proposed by Zdzisław I. Pawlak, along with some of the key definitions. More formal properties and boundaries of rough sets can be found in and cited references. The initial and basic theory of rough sets is sometimes referred to as "Pawlak Rough Sets" or "classical rough sets", as a means to distinguish it from more recent extensions and generalizations.
Information system framework
Let
be an information system (
attribute–value system), where
is a non-empty, finite set of objects (the universe) and
is a non-empty, finite set of attributes such that
for every
.
is the set of values that attribute
may take. The information table assigns a value
from
to each attribute
and object
in the universe
.
With any
there is an associated
equivalence relation
:
IND(P)=\left\{(x,y)\inU2\mid\foralla\inP,a(x)=a(y)\right\}
The relation
is called a
-indiscernibility relation. The partition of
is a family of all
equivalence classes of
and is denoted by
(or
).
If
, then
and
are
indiscernible (or indistinguishable) by attributes from
.
The equivalence classes of the
-indiscernibility relation are denoted
.
Example: equivalence-class structure
For example, consider the following information table:
Sample Information System! Object !!
!!
!!
!!
!!
| 1 | 2 | 0 | 1 | 1 |
---|
| 1 | 2 | 0 | 1 | 1 |
---|
| 2 | 0 | 0 | 1 | 0 |
---|
| 0 | 0 | 1 | 2 | 1 |
---|
| 2 | 1 | 0 | 2 | 1 |
---|
| 0 | 0 | 1 | 2 | 2 |
---|
| 2 | 0 | 0 | 1 | 0 |
---|
| 0 | 1 | 2 | 2 | 1 |
---|
| 2 | 1 | 0 | 2 | 2 |
---|
| 2 | 0 | 0 | 1 | 0 | |
---|
When the full set of attributes
is considered, we see that we have the following seven equivalence classes:
\begin{cases}\{O1,O2\}\
\{O3,O7,O10\}\
\{O4\}\
\{O5\}\\
\{O6\}\\
\{O8\}\\
\{O9\}\end{cases}
Thus, the two objects within the first equivalence class,
, cannot be distinguished from each other based on the available attributes, and the three objects within the second equivalence class,
, cannot be distinguished from one another based on the available attributes. The remaining five objects are each discernible from all other objects.
It is apparent that different attribute subset selections will in general lead to different indiscernibility classes. For example, if attribute
alone is selected, we obtain the following, much coarser, equivalence-class structure:
\begin{cases}
\{O1,O2\}\
\{O3,O5,O7,O9,O10\}\
\{O4,O6,O8\}\end{cases}
Definition of a rough set
Let
be a target set that we wish to represent using attribute subset
; that is, we are told that an arbitrary set of objects
comprises a single class, and we wish to express this class (i.e., this subset) using the equivalence classes induced by attribute subset
. In general,
cannot be expressed exactly, because the set may include and exclude objects which are indistinguishable on the basis of attributes
.
For example, consider the target set
, and let attribute subset
, the full available set of features. The set
cannot be expressed exactly, because in
, objects
are indiscernible. Thus, there is no way to represent any set
which
includes
but
excludes objects
and
.
However, the target set
can be
approximated using only the information contained within
by constructing the
-lower and
-upper approximations of
:
{\underlineP}X=\{x\mid[x]P\subseteqX\}
{\overlineP}X=\{x\mid[x]P\capX ≠ \emptyset\}
Lower approximation and positive region
The
-lower approximation, or
positive region, is the union of all equivalence classes in
which are contained by (i.e., are subsets of) the target set – in the example,
{\underlineP}X=\{O1,O2\}\cup\{O4\}
, the union of the two equivalence classes in
which are contained in the target set. The lower approximation is the complete set of objects in
that can be
positively (i.e., unambiguously) classified as belonging to target set
.
Upper approximation and negative region
The
-upper approximation is the union of all equivalence classes in
which have non-empty intersection with the target set – in the example,
{\overlineP}X=\{O1,O2\}\cup\{O4\}\cup\{O3,O7,O10\}
, the union of the three equivalence classes in
that have non-empty intersection with the target set. The upper approximation is the complete set of objects that in
that
cannot be positively (i.e., unambiguously) classified as belonging to the
complement (
) of the target set
. In other words, the upper approximation is the complete set of objects that are
possibly members of the target set
.
The set
therefore represents the
negative region, containing the set of objects that can be definitely ruled out as members of the target set.
Boundary region
The boundary region, given by set difference
{\overlineP}X-{\underlineP}X
, consists of those objects that can neither be ruled in nor ruled out as members of the target set
.
In summary, the lower approximation of a target set is a conservative approximation consisting of only those objects which can positively be identified as members of the set. (These objects have no indiscernible "clones" which are excluded by the target set.) The upper approximation is a liberal approximation which includes all objects that might be members of target set. (Some objects in the upper approximation may not be members of the target set.) From the perspective of
, the lower approximation contains objects that are members of the target set with certainty (probability = 1), while the upper approximation contains objects that are members of the target set with non-zero probability (probability > 0).
The rough set
The tuple
\langle{\underlineP}X,{\overlineP}X\rangle
composed of the lower and upper approximation is called a
rough set; thus, a rough set is composed of two crisp sets, one representing a
lower boundary of the target set
, and the other representing an
upper boundary of the target set
.
The accuracy of the rough-set representation of the set
can be given
[1] by the following:
\alphaP(X)=
\right|}{\left|{\overlineP}X\right|}
That is, the accuracy of the rough set representation of
,
,
, is the ratio of the number of objects which can
positively be placed in
to the number of objects that can
possibly be placed in
– this provides a measure of how closely the rough set is approximating the target set. Clearly, when the upper and lower approximations are equal (i.e., boundary region empty), then
, and the approximation is perfect; at the other extreme, whenever the lower approximation is empty, the accuracy is zero (regardless of the size of the upper approximation).
Objective analysis
Rough set theory is one of many methods that can be employed to analyse uncertain (including vague) systems, although less common than more traditional methods of probability, statistics, entropy and Dempster–Shafer theory. However a key difference, and a unique strength, of using classical rough set theory is that it provides an objective form of analysis.[2] Unlike other methods, as those given above, classical rough set analysis requires no additional information, external parameters, models, functions, grades or subjective interpretations to determine set membership – instead it only uses the information presented within the given data.[3] More recent adaptations of rough set theory, such as dominance-based, decision-theoretic and fuzzy rough sets, have introduced more subjectivity to the analysis.
Definability
In general, the upper and lower approximations are not equal; in such cases, we say that target set
is
undefinable or
roughly definable on attribute set
. When the upper and lower approximations are equal (i.e., the boundary is empty),
{\overlineP}X={\underlineP}X
, then the target set
is
definable on attribute set
. We can distinguish the following special cases of undefinability:
is
internally undefinable if
and
. This means that on attribute set
, there are
no objects which we can be certain belong to target set
, but there
are objects which we can definitively exclude from set
.
is
externally undefinable if
{\underlineP}X ≠ \emptyset
and
. This means that on attribute set
, there
are objects which we can be certain belong to target set
, but there are
no objects which we can definitively exclude from set
.
is
totally undefinable if
and
. This means that on attribute set
, there are
no objects which we can be certain belong to target set
, and there are
no objects which we can definitively exclude from set
. Thus, on attribute set
, we cannot decide whether any object is, or is not, a member of
.
Reduct and core
An interesting question is whether there are attributes in the information system (attribute–value table) which are more important to the knowledge represented in the equivalence class structure than other attributes. Often, we wonder whether there is a subset of attributes which can, by itself, fully characterize the knowledge in the database; such an attribute set is called a reduct.
Formally, a reduct is a subset of attributes
such that
=
, that is, the equivalence classes induced by the reduced attribute set
are the same as the equivalence class structure induced by the full attribute set
.
is
minimal, in the sense that
for any attribute
; in other words, no attribute can be removed from set
without changing the equivalence classes
.
A reduct can be thought of as a sufficient set of features – sufficient, that is, to represent the category structure. In the example table above, attribute set
is a reduct – the information system projected on just these attributes possesses the same equivalence class structure as that expressed by the full attribute set:
\begin{cases}\{O1,O2\}\
\{O3,O7,O10\}\
\{O4\}\
\{O5\}\\
\{O6\}\\
\{O8\}\\
\{O9\}\end{cases}
Attribute set
is a reduct because eliminating any of these attributes causes a collapse of the equivalence-class structure, with the result that
.
The reduct of an information system is not unique: there may be many subsets of attributes which preserve the equivalence-class structure (i.e., the knowledge) expressed in the information system. In the example information system above, another reduct is
, producing the same equivalence-class structure as
.
The set of attributes which is common to all reducts is called the core: the core is the set of attributes which is possessed by every reduct, and therefore consists of attributes which cannot be removed from the information system without causing collapse of the equivalence-class structure. The core may be thought of as the set of necessary attributes – necessary, that is, for the category structure to be represented. In the example, the only such attribute is
; any one of the other attributes can be removed singly without damaging the equivalence-class structure, and hence these are all
dispensable. However, removing
by itself
does change the equivalence-class structure, and thus
is the
indispensable attribute of this information system, and hence the core.
It is possible for the core to be empty, which means that there is no indispensable attribute: any single attribute in such an information system can be deleted without altering the equivalence-class structure. In such cases, there is no essential or necessary attribute which is required for the class structure to be represented.
Attribute dependency
One of the most important aspects of database analysis or data acquisition is the discovery of attribute dependencies; that is, we wish to discover which variables are strongly related to which other variables. Generally, it is these strong relationships that will warrant further investigation, and that will ultimately be of use in predictive modeling.
In rough set theory, the notion of dependency is defined very simply. Let us take two (disjoint) sets of attributes, set
and set
, and inquire what degree of dependency obtains between them. Each attribute set induces an (indiscernibility) equivalence class structure, the equivalence classes induced by
given by
, and the equivalence classes induced by
given by
.
Let
, where
is a given equivalence class from the equivalence-class structure induced by attribute set
. Then, the
dependency of attribute set
on attribute set
,
, is given by
\gammaP(Q)=
| | N | | \sum | | \left|{\underlineP | | i=1 | |
|
Q |
i\right|}{\left|U\right|}\leq1
That is, for each equivalence class
in
, we add up the size of its lower approximation by the attributes in
, i.e.,
. This approximation (as above, for arbitrary set
) is the number of objects which on attribute set
can be positively identified as belonging to target set
. Added across all equivalence classes in
, the numerator above represents the total number of objects which – based on attribute set
– can be positively categorized according to the classification induced by attributes
. The dependency ratio therefore expresses the proportion (within the entire universe) of such classifiable objects. The dependency
"can be interpreted as a proportion of such objects in the information system for which it suffices to know the values of attributes in
to determine the values of attributes in
".
Another, intuitive, way to consider dependency is to take the partition induced by
as the target class
, and consider
as the attribute set we wish to use in order to "re-construct" the target class
. If
can completely reconstruct
, then
depends totally upon
; if
results in a poor and perhaps a random reconstruction of
, then
does not depend upon
at all.
Thus, this measure of dependency expresses the degree of functional (i.e., deterministic) dependency of attribute set
on attribute set
; it is
not symmetric. The relationship of this notion of attribute dependency to more traditional information-theoretic (i.e., entropic) notions of attribute dependence has been discussed in a number of sources, e.g. Pawlak, Wong, & Ziarko (1988),
[4] Yao & Yao (2002),
[5] Wong, Ziarko, & Ye (1986),
[6] and Quafafou & Boussouf (2000).
[7] Rule extraction
The category representations discussed above are all extensional in nature; that is, a category or complex class is simply the sum of all its members. To represent a category is, then, just to be able to list or identify all the objects belonging to that category. However, extensional category representations have very limited practical use, because they provide no insight for deciding whether novel (never-before-seen) objects are members of the category.
What is generally desired is an intentional description of the category, a representation of the category based on a set of rules that describe the scope of the category. The choice of such rules is not unique, and therein lies the issue of inductive bias. See Version space and Model selection for more about this issue.
There are a few rule-extraction methods. We will start from a rule-extraction procedure based on Ziarko & Shan (1995).[8]
Decision matrices
Let us say that we wish to find the minimal set of consistent rules (logical implications) that characterize our sample system. For a set of condition attributes
and a decision attribute
, these rules should have the form
, or, spelled out,
(Pi=a)\land(Pj=b)\land...\land(Pk=c)\to(Q=d)
where
are legitimate values from the domains of their respective attributes. This is a form typical of
association rules, and the number of items in
which match the condition/antecedent is called the
support for the rule. The method for extracting such rules given in is to form a
decision matrix corresponding to each individual value
of decision attribute
. Informally, the decision matrix for value
of decision attribute
lists all attribute–value pairs that
differ between objects having
and
.
This is best explained by example (which also avoids a lot of notation). Consider the table above, and let
be the decision variable (i.e., the variable on the right side of the implications) and let
be the condition variables (on the left side of the implication). We note that the decision variable
takes on two different values, namely
. We treat each case separately.
First, we look at the case
, and we divide up
into objects that have
and those that have
. (Note that objects with
in this case are simply the objects that have
, but in general,
would include all objects having any value for
other than
, and there may be several such classes of objects (for example, those having
).) In this case, the objects having
are
while the objects which have
are
. The decision matrix for
lists all the differences between the objects having
and those having
; that is, the decision matrix lists all the differences between
and
. We put the "positive" objects (
) as the rows, and the "negative" objects
as the columns.
Decision matrix for
! Object !!
!!
!!
!!
!!
|
|
|
|
|
|
---|
|
|
|
|
|
|
---|
|
|
|
|
|
|
---|
|
|
|
|
|
|
---|
|
|
|
|
|
| |
---|
To read this decision matrix, look, for example, at the intersection of row
and column
, showing
in the cell. This means that
with regard to decision value
, object
differs from object
on attributes
and
, and the particular values on these attributes for the positive object
are
and
. This tells us that the correct classification of
as belonging to decision class
rests on attributes
and
; although one or the other might be dispensable, we know that
at least one of these attributes is
indispensable.
Next, from each decision matrix we form a set of Boolean expressions, one expression for each row of the matrix. The items within each cell are aggregated disjunctively, and the individuals cells are then aggregated conjunctively. Thus, for the above table we have the following five Boolean expressions:
\lor
\lor
\land
\lor
\land
\lor
\lor
\land
\lor
\lor
\land
\lor
\lor
\lor
\land
\lor
\land
\lor
\lor
\land
\lor
\lor
\land
\lor
\lor
\land
\land
\lor
\land
\lor
\lor
\land
\lor
\land
\land
\lor
\land
\lor
\lor
\land
\lor
\land
\land
\lor
\land
\lor
\lor
\land
Each statement here is essentially a highly specific (probably too specific) rule governing the membership in class
of the corresponding object. For example, the last statement, corresponding to object
, states that all the following must be satisfied:
- Either
must have value 2, or
must have value 0, or both.
must have value 0.
- Either
must have value 2, or
must have value 0, or both.
- Either
must have value 2, or
must have value 0, or
must have value 0, or any combination thereof.
must have value 0.
It is clear that there is a large amount of redundancy here, and the next step is to simplify using traditional Boolean algebra. The statement
\lor
\lor
\land
\lor
\land
\lor
\lor
\land
\lor
\lor
\land
\lor
corresponding to objects
simplifies to
, which yields the implication
(P1=1)\lor(P2=2)\to(P4=1)
Likewise, the statement
\lor
\land
\land
\lor
\land
\lor
\lor
\land
corresponding to objects
simplifies to
. This gives us the implication
(P1=2\landP2=0)\lor(P3=0\landP2=0)\to(P4=1)
The above implications can also be written as the following rule set:
\begin{cases}
(P1=1)\to(P4=1)\\
(P2=2)\to(P4=1)\\
(P1=2)\land(P2=0)\to(P4=1)\\
(P3=0)\land(P2=0)\to(P4=1)\end{cases}
It can be noted that each of the first two rules has a support of 1 (i.e., the antecedent matches two objects), while each of the last two rules has a support of 2. To finish writing the rule set for this knowledge system, the same procedure as above (starting with writing a new decision matrix) should be followed for the case of
, thus yielding a new set of implications for that decision value (i.e., a set of implications with
as the consequent). In general, the procedure will be repeated for each possible value of the decision variable.
LERS rule induction system
The data system LERS (Learning from Examples based on Rough Sets)[9] may induce rules from inconsistent data, i.e., data with conflicting objects. Two objects are conflicting when they are characterized by the same values of all attributes, but they belong to different concepts (classes). LERS uses rough set theory to compute lower and upper approximations for concepts involved in conflicts with other concepts.
Rules induced from the lower approximation of the concept certainly describe the concept, hence such rules are called certain. On the other hand, rules induced from the upper approximation of the concept describe the concept possibly, so these rules are called possible. For rule induction LERS uses three algorithms: LEM1, LEM2, and IRIM.
The LEM2 algorithm of LERS is frequently used for rule induction and is used not only in LERS but also in other systems, e.g., in RSES.[10] LEM2 explores the search space of attribute–value pairs. Its input data set is a lower or upper approximation of a concept, so its input data set is always consistent. In general, LEM2 computes a local covering and then converts it into a rule set. We will quote a few definitions to describe the LEM2 algorithm.
The LEM2 algorithm is based on an idea of an attribute–value pair block. Let
be a nonempty lower or upper approximation of a concept represented by a decision-value pair
. Set
depends on a set
of attribute–value pairs
if and only if
\emptyset ≠ [T]=capt[t]\subseteqX.
Set
is a
minimal complex of
if and only if
depends on
and no proper subset
of
exists such that
depends on
. Let
be a nonempty collection of nonempty sets of attribute–value pairs. Then
is a
local covering of
if and only if the following three conditions are satisfied:
each member
of
is a minimal complex of
,
is minimal, i.e.,
has the smallest possible number of members.
For our sample information system, LEM2 will induce the following rules:
\begin{cases}
(P1,1)\to(P4,1)\\
(P5,0)\to(P4,1)\\
(P1,0)\to(P4,2)\\
(P2,1)\to(P4,2)
\end{cases}
Other rule-learning methods can be found, e.g., in Pawlak (1991),[1] Stefanowski (1998),[11] Bazan et al. (2004),[10] etc.
Incomplete data
Rough set theory is useful for rule induction from incomplete data sets. Using this approach we can distinguish between three types of missing attribute values: lost values (the values that were recorded but currently are unavailable), attribute-concept values (these missing attribute values may be replaced by any attribute value limited to the same concept), and "do not care" conditions (the original values were irrelevant). A concept (class) is a set of all objects classified (or diagnosed) the same way.
Two special data sets with missing attribute values were extensively studied: in the first case, all missing attribute values were lost,[12] in the second case, all missing attribute values were "do not care" conditions.[13]
In attribute-concept values interpretation of a missing attribute value, the missing attribute value may be replaced by any value of the attribute domain restricted to the concept to which the object with a missing attribute value belongs.[14] For example, if for a patient the value of an attribute Temperature is missing, this patient is sick with flu, and all remaining patients sick with flu have values high or very-high for Temperature when using the interpretation of the missing attribute value as the attribute-concept value, we will replace the missing attribute value with high and very-high. Additionally, the characteristic relation, (see, e.g.,) enables to process data sets with all three kind of missing attribute values at the same time: lost, "do not care" conditions, and attribute-concept values.
Applications
Rough set methods can be applied as a component of hybrid solutions in machine learning and data mining. They have been found to be particularly useful for rule induction and feature selection (semantics-preserving dimensionality reduction). Rough set-based data analysis methods have been successfully applied in bioinformatics, economics and finance, medicine, multimedia, web and text mining, signal and image processing, software engineering, robotics, and engineering (e.g. power systems and control engineering). Recently the three regions of rough sets are interpreted as regions of acceptance, rejection and deferment. This leads to three-way decision making approach with the model which can potentially lead to interesting future applications.
History
The idea of rough set was proposed by Pawlak (1981) as a new mathematical tool to deal with vague concepts. Comer, Grzymala-Busse, Iwinski, Nieminen, Novotny, Pawlak, Obtulowicz, and Pomykala have studied algebraic properties of rough sets. Different algebraic semantics have been developed by P. Pagliani, I. Duntsch, M. K. Chakraborty, M. Banerjee and A. Mani; these have been extended to more generalized rough sets by D. Cattaneo and A. Mani, in particular. Rough sets can be used to represent ambiguity, vagueness and general uncertainty.
Extensions and generalizations
Since the development of rough sets, extensions and generalizations have continued to evolve. Initial developments focused on the relationship - both similarities and difference - with fuzzy sets. While some literature contends these concepts are different, other literature considers that rough sets are a generalization of fuzzy sets - as represented through either fuzzy rough sets or rough fuzzy sets. Pawlak (1995) considered that fuzzy and rough sets should be treated as being complementary to each other, addressing different aspects of uncertainty and vagueness.
Three notable extensions of classical rough sets are:
- Dominance-based rough set approach (DRSA) is an extension of rough set theory for multi-criteria decision analysis (MCDA), introduced by Greco, Matarazzo and Słowiński (2001).[15] The main change in this extension of classical rough sets is the substitution of the indiscernibility relation by a dominance relation, which permits the formalism to deal with inconsistencies typical in consideration of criteria and preference-ordered decision classes.
- Decision-theoretic rough sets (DTRS) is a probabilistic extension of rough set theory introduced by Yao, Wong, and Lingras (1990).[16] It utilizes a Bayesian decision procedure for minimum risk decision making. Elements are included into the lower and upper approximations based on whether their conditional probability is above thresholds
and
. These upper and lower thresholds determine region inclusion for elements. This model is unique and powerful since the thresholds themselves are calculated from a set of six loss functions representing classification risks.
- Game-theoretic rough sets (GTRS) is a game theory-based extension of rough set that was introduced by Herbert and Yao (2011).[17] It utilizes a game-theoretic environment to optimize certain criteria of rough sets based classification or decision making in order to obtain effective region sizes.
Rough membership
Rough sets can be also defined, as a generalisation, by employing a rough membership function instead of objective approximation. The rough membership function expresses a conditional probability that
belongs to
given
. This can be interpreted as a degree that
belongs to
in terms of information about
expressed by
.
Rough membership primarily differs from the fuzzy membership in that the membership of union and intersection of sets cannot, in general, be computed from their constituent membership as is the case of fuzzy sets. In this, rough membership is a generalization of fuzzy membership. Furthermore, the rough membership function is grounded more in probability than the conventionally held concepts of the fuzzy membership function.
Other generalizations
Several generalizations of rough sets have been introduced, studied and applied to solving problems. Here are some of these generalizations:
- Rough multisets[18]
- Fuzzy rough sets extend the rough set concept through the use of fuzzy equivalence classes[19]
- Alpha rough set theory (α-RST) - a generalization of rough set theory that allows approximation using of fuzzy concepts[20]
- Intuitionistic fuzzy rough sets[21]
- Generalized rough fuzzy sets[22] [23]
- Rough intuitionistic fuzzy sets[24]
- Soft rough fuzzy sets and soft fuzzy rough sets[25]
- Composite rough sets[26]
See also
Further reading
- Gianpiero Cattaneo and Davide Ciucci, "Heyting Wajsberg Algebras as an Abstract Environment Linking Fuzzy and Rough Sets" in J.J. Alpigini et al. (Eds.): RSCTC 2002, LNAI 2475, pp. 77–84, 2002.
- Pawlak . Zdzisław . Rough sets . International Journal of Parallel Programming . 11 . 5 . 341–356 . 1982 . 10.1007/BF01001956. 9240608 .
- Pawlak, Zdzisław Rough Sets Research Report PAS 431, Institute of Computer Science, Polish Academy of Sciences (1981)
- Dubois . D. . Prade . H. . Rough fuzzy sets and fuzzy rough sets . International Journal of General Systems . 17 . 191–209 . 1990 . 10.1080/03081079008935107 . 2–3.
- Slezak . Dominik . Wroblewski . Jakub . Eastwood . Victoria . Synak . Piotr . Brighthouse: an analytic data warehouse for ad-hoc queries . Proceedings of the VLDB Endowment . 1 . 2 . 1337–1345 . 2008 . 10.14778/1454159.1454174.
- Wojciech . Ziarko . Rough sets as a methodology for data mining . Rough Sets in Knowledge Discovery 1: Methodology and Applications . 554–576 . Physica-Verlag . 1998 . Heidelberg.
- Pawlak . Zdzisław . Decision rules, Bayes' rule and rough sets . New Direction in Rough Sets, Data Mining, and Granular-soft Computing . 1–9 . 1999 . 10.1007/978-3-540-48061-7_1.
- Book: Pawlak
, Zdzisław
. Rough relations, reports . Institute of Computer Science . 435(3):205–218. }
- Orlowska . E. . Ewa Orłowska . Reasoning about vague concepts . Bulletin of the Polish Academy of Sciences . 35 . 643–652 . 1987.
- Polkowski . L. . Rough sets: Mathematical foundations . Advances in Soft Computing . 2002.
- Skowron . A. . Rough sets and vague concepts . Fundamenta Informaticae . 417–431 . 1996.
- Zhang J., Wong J-S, Pan Y, Li T. (2015). A parallel matrix-based method for computing approximations in incomplete information systems, IEEE Transactions on Knowledge and Data Engineering, 27(2): 326-339
- Burgin M. (1990). Theory of Named Sets as a Foundational Basis for Mathematics, In Structures in mathematical theories: Reports of the San Sebastian international symposium, September 25–29, 1990 (http://www.blogg.org/blog-30140-date-2005-10-26.html)
- Burgin, M. (2004). Unified Foundations of Mathematics, Preprint Mathematics LO/0403186, p39. (electronic edition: https://arxiv.org/ftp/math/papers/0403/0403186.pdf)
- Burgin, M. (2011), Theory of Named Sets, Mathematics Research Developments, Nova Science Pub Inc,
- Chen H., Li T., Luo C., Horng S-J., Wang G. (2015). A decision-theoretic rough set approach for dynamic data mining. IEEE Transactions on Fuzzy Systems, 23(6): 1958-1970
- Chen H., Li T., Luo C., Horng S-J., Wang G. (2014). A rough set-based method for updating decision rules on attribute values' coarsening and refining, IEEE Transactions on Knowledge and Data Engineering, 26(12): 2886-2899
- Chen H., Li T., Ruan D., Lin J., Hu C, (2013) A rough-set based incremental approach for updating approximations under dynamic maintenance environments. IEEE Transactions on Knowledge and Data Engineering, 25(2): 274-284
External links
Notes and References
- Book: Pawlak
, Zdzisław
. Zdzisław Pawlak . Rough Sets: Theoretical Aspects of Reasoning About Data . Kluwer Academic Publishing . 1991 . Dordrecht . 978-0-7923-1472-1.
- Pawlak . Zdzisław . Zdzisław Pawlak . Grzymala-Busse . Jerzy . Słowiński . Roman . Ziarko . Wojciech . 1 November 1995 . Rough sets . Communications of the ACM . 38 . 11 . 88-95 . 10.1145/219717.219791 .
- Düntsch . Ivo . Gediga . Günther . 1995 . Rough set dependency analysis in evaluation studies: An application in the study of repeated heart attacks . University of Ulster . Informatics Research Reports . 10 . 25-30 .
- Pawlak . Zdzisław . Zdzisław Pawlak . Wong . S. K. M. . Ziarko . Wojciech . Rough sets: Probabilistic versus deterministic approach . International Journal of Man-Machine Studies . 29 . 1 . 81–95 . 1988 . 10.1016/S0020-7373(88)80032-4.
- Yao . J. T. . Yao . Y. Y. . Induction of classification rules by granular computing . Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing (TSCTC'02) . 331–338 . Springer-Verlag . 2002 . 10.1007/3-540-45813-1_43 . London, UK.
- Wong . S. K. M. . Ziarko . Wojciech . Ye . R. Li . Comparison of rough-set and statistical methods in inductive learning . International Journal of Man-Machine Studies . 24 . 53–72 . 1986. 10.1016/S0020-7373(86)80033-5.
- Quafafou . Mohamed . Boussouf . Moussa . 1 January 2000 . Generalized rough sets based feature selection . Intelligent Data Analysis . 4 . 1 . 3-17 . 10.3233/IDA-2000-4102 .
- Ziarko . Wojciech . Shan . Ning . Discovering attribute relationships, dependencies and rules by using rough sets . Proceedings of the 28th Annual Hawaii International Conference on System Sciences (HICSS'95) . 293–299 . 1995 . Hawaii.
- Grzymala-Busse . Jerzy . A new version of the rule induction system LERS . Fundamenta Informaticae . 31 . 1 . 1997 . 27–39. 10.3233/FI-1997-3113 .
- Book: Bazan . Jan . Szczuka . Marcin . Wojna . Arkadiusz . Wojnarski . Marcin . Rough Sets and Current Trends in Computing . On the Evolution of Rough Set Exploration System . 3066 . 592–601 . 2004 . 10.1007/978-3-540-25929-9_73. Lecture Notes in Computer Science . 978-3-540-22117-3 . 10.1.1.60.3957 .
- Book: Stefanowski
, Jerzy
. On rough set based approaches to induction of decision rules . Rough Sets in Knowledge Discovery 1: Methodology and Applications . 500–529 . Physica-Verlag . Polkowski, Lech . 1998 . Heidelberg . 978-3-7908-1884-0.
- Stefanowski . Jerzy . Tsoukias . Alexis . Incomplete information tables and rough classification . Computational Intelligence . 17 . 545–566 . 2001 . 3 . 10.1111/0824-7935.00162. 22795201 . free .
- Kryszkiewicz . Marzena . Rules in incomplete systems . Information Sciences . 113 . 1999 . 271–292 . 10.1016/S0020-0255(98)10065-8 . 3–4.
- Book: Grzymala-Busse . Jerzy . Grzymala-Busse . Witold . Transactions on Rough Sets, vol. VI . An Experimental Comparison of Three Rough Set Approaches to Missing Attribute Values . 2007 . 31–50 . 10.1007/978-3-540-71200-8_3. Lecture Notes in Computer Science . 978-3-540-71198-8 .
- Greco . Salvatore . Matarazzo . Benedetto . Słowiński . Roman . Rough sets theory for multicriteria decision analysis . European Journal of Operational Research . 129 . 1 . 2001 . 1–47 . 10.1016/S0377-2217(00)00167-3. 12045346 .
- Yao. Y.Y.. Wong, S.K.M. . Lingras, P.. 1990. A decision-theoretic rough set model. Methodologies for Intelligent Systems, 5, Proceedings of the 5th International Symposium on Methodologies for Intelligent Systems. North-Holland. Knoxville, Tennessee, USA. 17–25.
- Herbert . Joseph P. . Yao . JingTao . Game-theoretic rough sets . Fundamenta Informaticae . 108 . 267–286 . 2011 . 10.3233/FI-2011-423 . 3–4.
- Grzymala-Busse. Jerzy . Raś . Zbigniew W. . Zemankova . Maria . 1 December 1987. North-Holland Publishing Co . Proceedings of the Second International Symposium on Methodologies for intelligent systems. 325-332. Charlotte, NC, USA . Amsterdam, Netherlands. 978-0-444-01295-1.
- Nakamura . A. . 1988 . Fuzzy rough sets . Notes on Multiple-Valued Logic in Japan . 9 . 1 . 1-8.
- Quafafou . Mohamed . May 2000 . α-RST: a generalization of rough set theory . Information Sciences . 124 . 1-4 . 301-316 . 10.1016/S0020-0255(99)00075-4 .
- Cornelis . Chris . De Cock . Martine . Kerre . Etienne E. . November 2003 . Intuitionistic fuzzy rough sets: at the crossroads of imperfect knowledge . Expert Systems . 20 . 5 . 260-270 . 10.1111/1468-0394.00250 .
- Generalized Rough Fuzzy Sets Based on Soft Sets . Feng . Feng . 2009 . IEEE . 2009 International Workshop on Intelligent Systems and Applications . 1-4 . Wuhan, China . 10.1109/IWISA.2009.5072885.
- Feng . Feng . Li . Changxing . Davvaz . B. . Ali . M. Irfan . July 2010 . Soft sets combined with fuzzy sets and rough sets: a tentative approach . Soft Computing . 14 . 9 . 899–911 . 10.1007/s00500-009-0465-6 .
- Thomas . K. V. . Nair . Latha S. . 2011 . Rough intuitionistic fuzzy sets in a lattice . International Mathematics Forum . 6 . 27 . 1327-1335 . 24 October 2024 .
- Meng . Dan . Zhang . Xiaohong . Qin . Keyun . December 2011 . Soft rough fuzzy sets and soft fuzzy rough sets . Computers & Mathematics with Applications . 62 . 12 . 4635-4645 . 10.1016/j.camwa.2011.10.049.
- Zhang . Junbo . Li . Tianrui . Chen . Hongmei . 1 February 2014 . Composite rough sets for dynamic data mining . Information Sciences . 257 . 81-100 . 10.1016/j.ins.2013.08.016.