Incremental decision tree explained

An incremental decision tree algorithm is an online machine learning algorithm that outputs a decision tree. Many decision tree methods, such as C4.5, construct a tree using a complete dataset. Incremental decision tree methods allow an existing tree to be updated using only new individual data instances, without having to re-process past instances. This may be useful in situations where the entire dataset is not available when the tree is updated (i.e. the data was not stored), the original data set is too large to process or the characteristics of the data change over time.

Applications

Methods

Here is a short list of incremental decision tree methods, organized by their (usually non-incremental) parent algorithms.

CART family

CART[1] (1984) is a nonincremental decision tree inducer for both classification and regression problems. developed in the mathematics and statistics communities. CART traces its roots to AID (1963)[2]

ID3/C4.5 family

ID3 (1986)[4] and C4.5 (1993)[5] were developed by Quinlan and have roots in Hunt's Concept Learning System (CLS, 1966)[6] The ID3 family of tree inducers was developed in the engineering and computer science communities.

note: ID6NB (2009)[12] is not incremental.

Other Incremental Learning Systems

There were several incremental concept learning systems that did not build decision trees, but which predated and influenced the development of the earliest incremental decision tree learners, notably ID4.[7] Notable among these was Schlimmer and Granger's STAGGER (1986),[13] which learned disjunctive concepts incrementally. STAGGER was developed to examine concepts that changed over time (concept drift). Prior to STAGGER, Michalski and Larson (1978)[14] investigated an incremental variant of AQ (Michalski, 1973),[15] a supervised system for learning concepts in disjunctive normal form (DNF). Experience with these earlier systems and others, to include incremental tree-structured unsupervised learning, contributed to a conceptual framework for evaluating incremental decision tree learners specifically, and incremental concept learning generally, along four dimensions that reflect the inherent tradeoffs between learning cost and quality:[7] (1) cost of knowledge base update, (2) the number of observations that are required to converge on a knowledge base with given characteristics, (3) the total effort (as a function of the first two dimensions) that a system exerts, and the (4) quality (often consistency) of the final knowledge base. Some of the historical context in which incremental decision tree learners emerged is given in Fisher and Schlimmer (1988),[16] and which also expands on the four factor framework that was used to evaluate and design incremental learning systems.

VFDT Algorithm

Very Fast Decision Trees learner reduces training time for large incremental data sets by subsampling the incoming data stream.

OLIN and IFN

GAENARI

See also

External links

Notes and References

  1. Book: Breiman . L. . Friedman . J.H. . Olshen . R.A. . Stone . C.J. . [{{GBurl|gLs6DwAAQBAJ|pg=PT5}} Classification and regression trees ]. Wadsworth International . Belmont, CA . 1984 . 978-1-351-46048-4 .
  2. Morgan . J.N . Sondquist . J.A. . Problems in the analysis of survey data, and a proposal . J. Amer. Statist. Assoc. . 58 . 302. 415–434 . 1963 . 10.1080/01621459.1963.10500855.
  3. S.L. . Crawford . Extensions to the CART algorithm . International Journal of Man-Machine Studies . 31 . 2 . 197–217 . 1989 . 10.1016/0020-7373(89)90027-8 .
  4. J.R. . Quinlan . Induction of Decision Trees . Machine Learning . 1 . 1 . 81–106 . 1986 . 10.1007/BF00116251 . 13252401 .
  5. Book: Quinlan, J.R. . [{{GBurl|b3ujBQAAQBAJ|pg=PR5}} C4.5: Programs for machine learning ]. Elsevier . 2014 . 1993 . 978-1-55860-238-0 .
  6. Book: Hunt . E.B. . Marin . J. . Stone . P.J. . Experiments in induction . Academic Press . 1966 . 978-0-12-362350-8 .
  7. Book: Schlimmer . J.C. . Fisher . D. . A case study of incremental concept induction . https://www.researchgate.net/publication/221603307 . AAAI'86: Proceedings of the Fifth National Conference on Artificial Intelligence . Morgan Kaufmann . 1986 . 496–501 .
  8. Book: Utgoff, P.E. . ID5: An incremental ID3 . 10.1016/B978-0-934613-64-4.50017-7 . https://web.cs.umass.edu/publication/docs/1987/UM-CS-1987-095.pdf . Machine Learning Proceedings 1988 Proceedings of the Fifth International Conference on Machine Learning . Morgan Kaufmann . 1988 . 978-0-934613-64-4 . 107–120 . Publishers.
  9. P.E. . Utgoff . Incremental induction of decision trees . Machine Learning . 4 . 2. 161–186 . 1989 . 10.1023/A:1022699900025 . 5293072 .
  10. Kroon, M., Korzec, S., Adriani, P. (2007) ID6MDL: Post-Pruning Incremental Decision Trees.
  11. Utgoff . P.E. . Berkman . N.C. . Clouse . J.A. . Decision tree induction based on efficient tree restructuring . Machine Learning . 29 . 5–44 . 1997 . 10.1023/A:1007413323501 . 2743403 .
  12. Appavu . S. . Rajaram . R. . Knowledge-based system for text classification using ID6NB algorithm ]. Knowledge-Based Systems . 22 . 1 . 1–7 . 2009 . 10.1016/j.knosys.2008.04.006 .
  13. Schlimmer . J.C. . Granger, Jr. . R.H. . Incremental learning from noisy data . Machine Learning . 1 . 317–354 . 1986 . 3 . 10.1007/BF00116895 . 33776987 . free .
  14. Michalski . R.S. . Larson . J.B. . Selection of most representative training examples and incremental generation of VL hypotheses: The underlying methodology and the description of the programs ESEL and AQ11 . 1978 . UIUCDCS-R-78-867 . University of Illinois, Department of Computer Science . 1920/1544/78-03.
  15. Book: Michalski, R.S. . Discovering classification rules using variable-valued logic system VL1 . http://digilib.gmu.edu/jspui/bitstream/handle/1920/1515/73-01.pdf . 1920/1515/73-01 . IJCAI'73: Proceedings of the Third International Joint Conference on Artificial Intelligence . Morgan Kaufmann . Stanford, CA . 1973 . 162–172 .
  16. Fisher . D. . Schlimmer . J. . Models of Incremental Concept Learning: A coupled research proposal . 1988 . CS-88-05 . Vanderbilt University .
  17. Book: Domingos . P. . Hulten . G. . Mining high-speed data streams . 10.1145/347090.347107 . http://web.cs.wpi.edu/~cs525/f13b-EAR/cs525-homepage/lectures/PAPERS/p71-domingos.pdf . Proceedings KDD Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining . ACM Press . 2000 . 1-58113-233-6 . 71–80 . 8810610 .
  18. Book: Hulten . G. . Spencer . L. . Domingos . P. . Mining time-changing data streams . https://www.csse.monash.edu.au/~mgaber/Mining%20TimeChanging.pdf . 10.1145/502512.502529 . Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining . ACM Press . 2001 . 978-1-58113-391-2 . 97–106 . 6416602 .
  19. Gama . J. . Fernandes . R. . Rocha . R. . Decision trees for mining data streams . Intelligent Data Analysis . 10 . 23–45 . 2006 . 10.3233/IDA-2006-10103 .
  20. M. . Last . Online classification of nonstationary data streams . Intell. Data Anal. . 6 . 2 . 129–147 . 2002 . 10.3233/IDA-2002-6203 .
  21. Cohen . L. . Avrahami . G. . Last . M. . Kandel . A. . Info-fuzzy algorithms for mining dynamic data streams . Applied Soft Computing . 8 . 4 . 1283–94 . 2008 . 10.1016/j.asoc.2007.11.003 .
  22. Book: Maimon . O. . Last . M. . The info-fuzzy network (IFN) methodology. Knowledge Discovery and Data Mining . Kluwer . 10.1007/978-1-4757-3296-2 . 2000 . 978-1-4757-3296-2 . 41520652 .