General Concept Lattice Explained
The General Concept Lattice (GCL) proposes a novel general construction of concept hierarchy from formal context, where the conventional Formal Concept Lattice based on Formal Concept Analysis (FCA) only serves as a substructure.[1]
The formal context is a data table of heterogeneous relations illustrating how objects carrying attributes. By analogy with truth-value table, every formal context can develop its fully extended version including all the columns corresponding to attributes constructed, by means of Boolean operations, out of the given attribute set. The GCL is based on the extended formal context which comprehends the full information content of formal context in the sense that it incorporates whatever the formal context should consistently imply. Noteworthily, different formal contexts may give rise to the same extended formal context.
Background
The GCL claims to take into account the extended formal context for preservation of information content. Consider describing a three-ball system (3BS) with three distinct colours, e.g.,
red,
green and
blue. According to
Table 1, one may refer to different attribute sets, say,
,
or
to reach different formal contexts. The concept hierarchy for the 3BS is supposed to be unique regardless of how the 3BS being described. However, the FCA exhibits different
formal concept lattices subject to the chosen formal contexts for the 3BS, see
Fig. 1. In contrast, the
GCL is an invariant lattice structure with respect to these formal contexts since they can infer each other and ultimately entail the same information content.
In information science, the Formal Concept Analysis (FCA) promises practical applications in various fields based on the following fundamental characteristics.
- It orders the formal concepts in a hierarchy i.e. the formal concept lattice (FCL) which can be visualized as a line diagram that may be helpful for understanding the data.
- It enables the attribute exploration,[2] a knowledge acquisition technique based on implications. It is possible to acquire the canonical (Guigues-Duquenne[3]) basis, the non-redundant collection of informative implications based on which valid implications available from the formal context can be derived by the Armstrong rules.
The FCL does not appear to be the only lattice applicable to the interpretation of data table. Alternative concept lattices subject to different derivation operators based on the notions relevant to the Rough Set Analysis have also been proposed.[4] [5] Specifically, the object-oriented concept lattice, which is referred to as the rough set lattice[6] (RSL) afterwards, is found to be particularly instructive to supplement the standard FCA in further understandings of the formal context.
- The FCL exhibits the categorisation for object class according to their common properties while the RSL is according to those properties which other classes do not possess.
- The RSL provides an alternative scheme for implications available from the formal context which are beyond the scope of FCL, as will be clarified later.
Consequently, there are two crucial points to be contemplated.
- The FCL and RSL reflect different concept hierarchies interpreting the same formal context in a complementary way. However, similar to the case of FCL, RSL also suffers from different lattice structures varying with respect to the chosen formal contexts, see Fig. 2.
- The implication relations extracted via the RSL from the formal context signify a different part of logic content from the ones extractable via the FCL. The treatment via the RSL would require further efforts of construction, the Guigues-Duquenne basis for the RSL. Moreover, it is unwarranted that the implications of these two together suffices the full logic content.
The GCL accomplishes a sound theoretical foundation for the concept hierarchies acquired from formal context. Maintaining the generality that preserves the information, the GCL underlies both the FCL and RSL, which correspond to substructures at particular restrictions. Technically, the GCL would be reduced to the FCL and RSL when restricted to conjunctions and disjunctions of elements in the referred attribute set (
), respectively. In addition, the
GCL unveils extra information complementary to the results via the FCL and RSL. Surprisingly, the implementation of formal context via
GCL is much more manageable than those via FCL and RSL.
Related mathematical formulations
Algebras of derivation operators
The derivation operators constitute the building blocks of concept lattices and thus deserve distinctive notations. Subject to a formal context concerning the object set
and attribute set
,
I :
\begin{array}{l}
X\subseteqG\mapsto XI=\lbracem\inM\midgRm, \forallg\inX\rbrace\subseteqM\\
Y\subseteqM\mapsto YI=\lbraceg\inG\midgRm, \forallm\inY\rbrace\subseteqG
\end{array},
\Box :
\begin{array}{l}
X\subseteqG\mapsto X\Box=\lbracem\inM\mid\forallg\inG,gRm\impliesg\inX\rbrace\subseteqM\\
Y\subseteqM\mapsto Y\Box=\lbraceg\inG\mid\forallm\inM,gRm\impliesm\inY\rbrace\subseteqG
\end{array},
X\subseteq G \mapsto\ X^= \lbrace m \in M \mid \exists g\in G, (gRm,\ g\in X) \rbrace\subseteq M\\
Y\subseteq M \mapsto\ Y^= \lbrace g \in G \mid \exists m\in M, (gRm,\ m\in Y) \rbrace\subseteq G
\endare considered as different modal operators (Sufficiency, Necessity and Possibility, respectively) that generalise the FCA. For notations,
, the operator adopted in the standard FCA, follows
[7] and
R. Wille;
as well as
follows Y. Y. Yao. By
, i.e.,
the object
carries the attribute
as its property, which is also referred to as
where
is the
set of all objects carrying the attribute .
With
X,X1,X2\subseteqGandXc:=G\backslashX
it is straightforward to check that
XIII=XI,
\begin{array}{c}
X\Box\Diamond\Box=X\Box\\
X\Diamond\Box\Diamond=X\Diamond\end{array},
\begin{array}{c}
Xc\Box=X\Diamond\\
Xc\Diamond=X\Box\end{array},
X1\subseteqX2\iff
| I,
\begin{array}{c}
X |
(X | |
| 1\subseteq |
X2\iff
\subseteq
\\
X1\subseteqX2
\iff
\subseteq
\end{array},
where the same relations hold if given in terms of
Y,Y1,Y2\subseteqMandYc:=M\backslashY
.
Two Galois lattices
Galois connections
From the above algebras, there exist different types of Galois connections, e.g., (1)
, (2)
and (3)
that corresponds to (2) when one replaces
and
. Note that (1) and (2) enable different object-oriented constructions for the concept hierarchies FCL and RSL, respectively. Note that (3) corresponds to the attribute-oriented construction where the roles of object and attribute in the RSL are exchanged. The FCL and RSL apply to different 2-tuple
concept collections that manifest different well-defined partial orderings.
Two concept hierarchies
Given as a concept, the 2-tuple
is in general constituted by an
extent
and an
intent
, which should be distinguished when applied to FCL and RSL. The concept
is furnished by
based on (1) while
is furnished by
based on (2). In essence, there are two Galois lattices based on different orderings of the two collections of concepts as follows.
entails
and since
iff
, and
iff
.
entails
and since
iff
, and
iff
.
Common extents of FCL and RSL
Every attribute listed in the formal context provides an extent for FCL and RSL simultaneously via the object set carrying the attribute. Though the extents for FCL and for RSL do not coincide totally, every
for
is known to be a common extent of FCL and RSL. This turns up from the main results in FCL and RSL: every
(
) is an extent for FCL and
is an extent for RSL. Note that choosing
gives rise to
.
Two types of informative implications
The consideration of the attribute set-to-set implication (
) via FCL has an intuitive interpretation: every object possessing all the attributes in
possesses all the attributes in
, in other words
. Alternatively, one may consider
based on the RSL in a similar manner: the set of all objects carrying
any of the attributes in
is contained in the set of all objects carrying
any of the attributes in
, in other words
A\Diamond\subseteqB\Diamond
. It is apparent that
and
relate different pairs of attribute sets and are incapable of expressing each other.
Extension of formal context
For every formal context one may acquire its extended version deduced in the sense of completing a truth-value table. It is instructive to explicitly label the object/attribute dependence for the formal context, say,
rather than
since one may have to investigate more than one formal contexts. As is illustrated in
Table 1,
can be employed to deduce the extended version
| \ast |
F | |
| \scriptscriptstyle3BS |
(G,M\ast)
, where
is the set of all attributes constructed out of elements in
by means of Boolean operations. Note that
includes three columns reflecting the use of
and
the attribute set
.
Obtaining the general concept lattice
Observations based on mathematical facts
Intents in terms of single attributes
The FCL and RSL will not be altered if their intents are interpreted as single attributes.
can be understood as
with
(the conjunction of all elements in
),
\begin{smallmatrix}\prodXI=\mu\\
\muR=X\end{smallmatrix}
plays the role of
\begin{smallmatrix}XI=Y\\
YI=X\end{smallmatrix}
since
.
can be understood as
with
(the disjunction of all elements in
),
\begin{smallmatrix}\sumX\Box=\mu\\
\muR=X\end{smallmatrix}
plays the role of
\begin{smallmatrix}X\Box=Y\\
Y\Diamond=X\end{smallmatrix}
since
.Here, the dot product
stands for the conjunction (the dots is often omitted for compactness) and the summation
the disjunction, which are notations in the Curry-Howard style. Note that the orderings become
(X1,\mu1)fcl\leq(X2,\mu2)fcl
and
(X1,\mu1)rsl\leq(X2,\mu2)rsl
, both are implemented by
.
Implications from single attribute to single attribute
Concerning the implications extracted from formal context,
serves as the general form of implication relations available from the formal context, which holds for any pair of
fulfilling
.Note that
turns out to be trivial if
, which entails
. Intuitively, every object carrying
is an object carrying
, which means the implication
any object having the property
must also have the property
. In particular,
can be interpreted as
with
and
,
can be interpreted as
with
and
,where
and
A\Diamond\subseteqB\Diamond
collapse into
.
Lattice of 3-tuple concepts with double Galois connection
When extended to , the algebras of derivation operators remain formally unchanged, apart from the generalisation from to which is signified in terms of the replacements
,
and
. The concepts under consideration become then
and
, where
and
, which are constructions allowable by the two Galois connections i.e.
X\subseteq
\iffY\subseteq
and
, respectively. Henceforth,
and for
,
and for
.
The extents for the two concepts now coincide exactly. All the attributes in are listed in the formal context , each contributes a common extent for FCL and RSL. Furthermore, the collection of these common extents amounts to which exhausts all the possible unions of the minimal object sets discernible by the formal context. Note that each
collects
objects of the same property, see
Table 2. One may then join
and
into a 3-tuple with common extent:
where
,
and
.Note that
are introduced in order to differentiate the two intents. Clearly, the number of these 3-tuples equals the cardinality of set of common extent which counts
. Moreover,
manifests well-defined ordering. For
, where
and
,
iff
and and .
Emergence of the GCL
While it is generically impossible to determine subject to , the structure of concept hierarchy need not rely on these intents directly. An efficient way to implement the concept hierarchy for is to consider intents in terms of single attributes.
Let henceforth and . Upon introducing , one may check that and , . Therefore, , which is a closed interval bounded from below by and from above by since
\forall\mu \muR=X\impliesη(X)\leq\mu\leq\rho(X)
. Moreover,
iff
,
iff
iff
.In addition,
, namely, the collection of intents
exhausts all the generalised attributes
, in comparison to
. Then, the
GCL enters as the lattice structure
based on the formal context via
:
- The collection of all the general concepts constitutes the poset ordered as
iff and and .
(meet) and
(join) operations are applicable for finding further lattice points:
l1\wedgel2=\left(X1\capX2,[X1\capX2]F\right)
\inLF
, where
, where
- The GCL appears to be a complete lattice since both and can be found in :
, .
Consequence of the general concept lattice
Manageable general lattice
The construction for FCL was known to count on efficient algorithms,[8] [9] not to mention the construction for RSL which did not receive much attention yet. Intriguingly, though the GCL furnishes the general structure on which both the FCL and RSL can be rediscovered, the GCL can be acquired via simple readout.
Reading out the lattice
The completion of GCL is equivalent to the completion of the intents of GCL in terms of the lower and bounds.
- The lower bounds can be employed to determine the upper bounds , and vice versa. For concreteness, both and are extents of the GCL, coexists with . Subsequently, and , where .
- The lower bounds of intents corresponding to minimal discernible object sets (s for
) can be employed to determine all the intents. Note that and appears to be a direct readout by means of .The above enables the determinations of the intents depicted as in Fig. 3 for the 3BS given by Table 1, where one can read out that , and . Hence, e.g., , . Note that the GCL also appears to be a Hasse diagram due to the resemblance of its extents to a power set. Moreover, each intent at also exhibits another Hasse diagram isomorphic to the ordering of attributes in the closed interval . It can be shown that where with . Hence, making the cardinality a constant given as . Clearly, one may check that
Rediscovering FCL and RSL on the GCL
The GCL underlies the original FCL and RSL subject to , as one can tell from and . To rediscover a node for FCL, one looks for a conjunction of attributes in contained in , which can be identified within the conjunctive normal form of if exists. Likewise, for the RSL one looks for a disjunction of attributes in contained in , which can be found within the disjunctive normal form of , see Fig 3.
For instance, from the node on the GCL, one finds that . Note that appears to be the only attribute belonging to , which is simultaneously a conjunction and a disjunction. Therefore, both the FCL and RSL have the concept in common. To illustrate a different situation, . Apparently, is the attribute emerging as disjunction of elements in which belongs to , in which no attribute composed by conjunction of elements in is found. Hence, could not be an extent of FCL, it only constitutes the concept for the RSL.
Information content of a formal context
Informative implications as equivalence due to categorisation
Non-tautological implication relations signify the information contained in the formal context and are referred to as informative implications. In general, entails the implication . The implication is informative if it is (i.e. ).
In case it is strictly , one has where . Then, can be replaced by means of together with the tautology . Therefore, what remains to be taken into account is the equivalence for some . Logically, both attributes are properties carried by the same object class, reflects that equivalence relation.
All attributes in must be mutually implied, which can be implemented, e.g., by (in fact, where is a tautology), i.e., all attributes are equivalent to the lower bound of intent.
A formula that implements all the informative implications
Extraction of the implications of type from the formal context was known to be complicated,[10] [11] [12] [13] [14] it necessitates efforts for constructing a canonical basis, which does not apply to the implications of type . By contrast, the above equivalence only proposes
- the single formula generating all the informative implications:
, which can be restated as ,
is allowed by the formal context iff (or ).Hence, purely algebraic formulae can be employed to determine the implication relations, one need not consult the object-attribute dependence in the formal context, which is the typical effort in finding the canonical basis.
Remarkably, and are referred to as the contextual truth and falsity, respectively.
and
as well as and
similar to the conventional truth 1 and falsity 0 that can be identified with and , respectively.
Beyond the set-to-set implications
and are found to be particular forms of . Assume and for both cases. By , an object set carrying all the attributes in implies carrying all the attributes in simultaneously, i.e. . By , an object set carrying any of the attributes in implies carrying some of the attributes in , therefore . Notably, the point of view conjunction-to-conjunction has also been emphasised by Ganter while dealing with the attribute exploration.
One could overlook significant parts of the logic content in formal context were it not for the consideration based on the GCL. Here, the formal context describing 3BS given in Table 1 suggests an extreme case where no implication of the type could be found. Nevertheless, one ends up, e.g., (or ), whose meaning appears to be ambiguous. Though it is true that , one also notices that as well as . Indeed, by using the above formula with the
provided in Fig. 2 it can be seen that , hence it is and that underlies .
Remarkably, the same formula will lead to (1) (or ) and (2) (or ), where , and can be interchanged. Hence, what one has captured from the 3BS are that (1) no two colours could coexist and that (2) there is no colour other than , and . The two issues are certainly less trivial in the scopes of and .
Rules to assemble or transform implications
The rules to assemble or transform implications of type are of direct consequences of object set inclusion relations. Notably, some of these rules can be reduced to the Armstrong axioms, which pertain to the main considerations of Guigues and Duquenne based on the non-redundant collection of informative implications acquired via FCL. In particular, (1) and since and leads to , i.e., .In the case of , , and , where are sets of attributes, the rule (1) can be re-expressed as Armstrong's composition: (1') and and .
The Armstrong axioms are not suited for which requires . This is in contrast to for which Armstrong's reflexivity is implemented by . Nevertheless, a similar composition may occur but signify a different rule from (1). Note that one also arrives at (2) and since and , which gives rise to (2') and whenever , , and .
Example
For concreteness, consider the example depicted by Table 2, which has been originally adopted for clarification of the RSL but worked out for the GCL.
The GCL structure and the identifications of FCL and RSL on the GCL
- The determinations of the nodes of GCL for Table 2 are straightforward, as is depicted in Fig.4. For example, one may read out
, , , and so forth.
Clearly, one may also check that .
- To rediscover the original FCL and RSL see Fig. 5. Observe, e.g.,
, .
Within the expression of it can be seen that , while within it can be seen . Therefore, one finds out the concepts for FCL and for RSL. By contrast,
,
with gives rise to the concept for FCL however fails to provide an extent for RSL because .
Implication relations in general
- The meanings of and are essentially different.
and denote and , respectively.
For the present case, the above relations can be examined via the auxiliary formula:
(or
),
(or
).
- and are equivalent when both
are reduced to sets of single element.
Both and , according to the formal context of Table 2, are interpreted as , which means based on and based on .
Note that . Moreover, entails both and , which correspond to and , respectively.
- The single formula suffices to generate all the informative implications, where one may choose any attribute in
as the antecedent or consequent. (1) With
one may infer the properties of objects of interest from the condition
by specifying
, thereby incorporating abundant informative implications as equivalent relations between any pair of attributes within the interval
, i.e.,
if
and
. Note that
entails
since
.
For instance, by the relation is neither of the type nor of the type . Nevertheless, one may also derive, e.g., , and , which are , and , respectively. As a further interesting implication entails by means of material implication. Namely, for the objects carrying the property or , must hold and, in addition, objects carrying the property must also carry the property and vice versa.
(1') Alternatively, the equivalent formula
can be employed to specify the objects of particular interest. In effect,
if
and
.
One may be interested in the properties inferring a particular consequent, say, . Consider giving rise to according to Table 2. Clearly, with
one has . This gives rise to many possible antecedents such as , , , and so forth.
(2)
governs all the implications extractable from the formal context by means of (1) and (1'). Indeed, it plays the role of canonical basis with
one single implication relation.
can be understood as or equivalently , which turns out be the only non-redundant implication one needs to deduce all the informative implications from any formal context. The basis or suffices the deduction of all implications as follows. While and , choosing either or gives rise to . Notably, this encompasses (1) and (1') by means of
\leq\rho(\muR)\equiv\mu+0\rho
for any
, where
can be identified with some corresponding to one of the 32 nodes on the GCL in Fig. 4. develops equivalence, at each single node, for all attributes contained within the interval . Moreover, informative implications could also relate different nodes via Hypothetical syllogism by invoking tautology. Typically, whenever . This corresponds to the cases considered in (1'): , , etc. Explicitly, is based upon and where . Note that and while (also ). Therefore, . Similarly, with gives .
Indeed, or equivalently plays the role of canonical basis with one single implication relation.
Notes and References
- Book: Ganter . Bernhard . Formal Concept Analysis . Wille . Rudolf . 1999 . 978-3-540-62771-5 . 10.1007/978-3-642-59830-2 . 2023-07-16 . https://web.archive.org/web/20240225153503/https://link.springer.com/book/10.1007/978-3-642-59830-2 . 2024-02-25 . live . 262487114.
- Book: Ganter . Bernhard . Conceptual exploration . Obiedkov . Sergei . 2016 . Springer-Verlag . 978-3-662-49290-1 . Berlin.
- Guigues . J. L. . Duquenne . V. . 1986 . Familles minimales d'implications informatives résultant d'un tableau de données binaires . Mathématiques et Sciences Humaines . 95 . 5–18 . 0987-6936 . 2023-07-19 . 2022-04-19 . https://web.archive.org/web/20220419154311/https://eudml.org/doc/94331 . live .
- Book: Duntsch . N. . Gediga . G. . Modal-style operators in qualitative data analysis . 2002 . 2002 IEEE International Conference on Data Mining, 2002. Proceedings. . https://ieeexplore.ieee.org/document/1183898 . IEEE Comput. Soc . 155–162 . 10.1109/ICDM.2002.1183898 . 978-0-7695-1754-4 . 13170017 . 2024-01-07 . 2023-12-06 . https://web.archive.org/web/20231206170723/http://ieeexplore.ieee.org/document/1183898/ . live .
- Book: Yao, Y.Y. . Concept lattices in rough set theory . 2004 . IEEE Annual Meeting of the Fuzzy Information, 2004. Processing NAFIPS '04 . http://dx.doi.org/10.1109/nafips.2004.1337404 . 796-801 Vol.2 . IEEE . 10.1109/nafips.2004.1337404 . 0-7803-8376-1 . 6716057 . 2023-07-19 . 2024-02-25 . https://web.archive.org/web/20240225153447/https://ieeexplore.ieee.org/document/1337404/ . live .
- Liaw . Tsong-Ming . Lin . Simon C. . 2020-10-12 . A general theory of concept lattice with tractable implication exploration . Theoretical Computer Science . en . 837 . 84–114 . 10.1016/j.tcs.2020.05.014 . 219514253 . 0304-3975 . 2023-07-19 . 2020-05-28 . https://web.archive.org/web/20200528022615/https://www.sciencedirect.com/science/article/pii/S0304397520302826 . live .
- Ganter . Bernhard . 1999-04-06 . Attribute exploration with background knowledge . Theoretical Computer Science . ORDAL'96 . 217 . 2 . 215–233 . 10.1016/S0304-3975(98)00271-0 . 0304-3975 . free .
- Kuznetsov . Sergei O. . 2001-12-01 . On Computing the Size of a Lattice and Related Decision Problems . Order . en . 18 . 4 . 313–321 . 10.1023/A:1013970520933 . 11571279 . 1572-9273 . 2023-08-12 . 2024-02-25 . https://web.archive.org/web/20240225153445/https://link.springer.com/article/10.1023/A:1013970520933 . live .
- Kuznetsov . Sergei O. . Obiedkov . Sergei A. . April 2002 . Comparing performance of algorithms for generating concept lattices . Journal of Experimental & Theoretical Artificial Intelligence . en . 14 . 2–3 . 189–216 . 10.1080/09528130210164170 . 10784843 . 0952-813X . 2023-08-12 . 2023-10-17 . https://web.archive.org/web/20231017163804/http://www.tandfonline.com/doi/abs/10.1080/09528130210164170 . live .
- Kuznetsov . Sergei O. . Obiedkov . Sergei . 2008-06-06 . Some decision and counting problems of the Duquenne–Guigues basis of implications . Discrete Applied Mathematics . In Memory of Leonid Khachiyan (1952–2005) . 156 . 11 . 1994–2003 . 10.1016/j.dam.2007.04.014 . 0166-218X . free .
- Book: Sertkaya, Barış . 2009 . Rudolph . Sebastian . Dau . Frithjof . Kuznetsov . Sergei O. . Towards the Complexity of Recognizing Pseudo-intents . https://link.springer.com/chapter/10.1007/978-3-642-03079-6_22 . Conceptual Structures: Leveraging Semantic Technologies . Lecture Notes in Computer Science . 5662 . en . Berlin, Heidelberg . Springer . 284–292 . 10.1007/978-3-642-03079-6_22 . 978-3-642-03079-6 . Archived copy . 2023-09-04 . 2023-09-04 . https://web.archive.org/web/20230904125549/https://link.springer.com/chapter/10.1007/978-3-642-03079-6_22 . live .
- Book: Distel, Felix . 2010 . Kwuida . Léonard . Sertkaya . Barış . Hardness of Enumerating Pseudo-intents in the Lectic Order . https://link.springer.com/chapter/10.1007/978-3-642-11928-6_9 . Formal Concept Analysis . Lecture Notes in Computer Science . 5986 . en . Berlin, Heidelberg . Springer . 124–137 . 10.1007/978-3-642-11928-6_9 . 978-3-642-11928-6 . Archived copy . 2023-09-04 . 2023-09-04 . https://web.archive.org/web/20230904125546/https://link.springer.com/chapter/10.1007/978-3-642-11928-6_9 . live .
- Distel . Felix . Sertkaya . Barış . 2011-03-28 . On the complexity of enumerating pseudo-intents . Discrete Applied Mathematics . 159 . 6 . 450–466 . 10.1016/j.dam.2010.12.004 . 17769297 . 0166-218X . 2023-09-04 . 2023-09-04 . https://web.archive.org/web/20230904125545/https://www.sciencedirect.com/science/article/pii/S0166218X10004105 . live .
- Babin . Mikhail A. . Kuznetsov . Sergei O. . 2013-04-01 . Computing premises of a minimal cover of functional dependencies is intractable . Discrete Applied Mathematics . 161 . 6 . 742–749 . 10.1016/j.dam.2012.10.026 . 0166-218X . free .