Lazy learning explained

(Not to be confused with the lazy learning regime, see Neural_tangent_kernel).

In machine learning, lazy learning is a learning method in which generalization of the training data is, in theory, delayed until a query is made to the system, as opposed to eager learning, where the system tries to generalize the training data before receiving queries.[1]

The primary motivation for employing lazy learning, as in the K-nearest neighbors algorithm, used by online recommendation systems ("people who viewed/purchased/listened to this movie/item/tune also ...") is that the data set is continuously updated with new entries (e.g., new items for sale at Amazon, new movies to view at Netflix, new clips at YouTube, new music at Spotify or Pandora). Because of the continuous update, the "training data" would be rendered obsolete in a relatively short time especially in areas like books and movies, where new best-sellers or hit movies/music are published/released continuously. Therefore, one cannot really talk of a "training phase".

Lazy classifiers are most useful for large, continuously changing datasets with few attributes that are commonly queried. Specifically, even if a large set of attributes exist - for example, books have a year of publication, author/s, publisher, title, edition, ISBN, selling price, etc. - recommendation queries rely on far fewer attributes - e.g., purchase or viewing co-occurrence data, and user ratings of items purchased/viewed.[2]

Advantages

The main advantage gained in employing a lazy learning method is that the target function will be approximated locally, such as in the k-nearest neighbor algorithm. Because the target function is approximated locally for each query to the system, lazy learning systems can simultaneously solve multiple problems and deal successfully with changes in the problem domain. At the same time they can reuse a lot of theoretical and applied results from linear regression modelling (notably PRESS statistic) and control.[3] It is said that the advantage of this system is achieved if the predictions using a single training set are only developed for few objects.[4] This can be demonstrated in the case of the k-NN technique, which is instance-based and function is only estimated locally.[5] [6]

Disadvantages

Theoretical disadvantages with lazy learning include:

There are standard techniques to improve re-computation efficiency so that a particular answer is not recomputed unless the data that impact this answer has changed (e.g., new items, new purchases, new views). In other words, the stored answers are updated incrementally.

This approach, used by large e-commerce or media sites, has long been used in the Entrez portal of the National Center for Biotechnology Information (NCBI) to precompute similarities between the different items in its large datasets: biological sequences, 3-D protein structures, published-article abstracts, etc. Because "find similar" queries are asked so frequently, the NCBI uses highly parallel hardware to perform nightly recomputation. The recomputation is performed only for new entries in the datasets against each other and against existing entries: the similarity between two existing entries need not be recomputed.

Examples of Lazy Learning Methods

Further reading

Notes and References

  1. Book: Aha . David . Lazy Learning . 29 June 2013 . Springer Science & Business Media, 2013 . 978-9401720533 . 424 . illustrated . 30 September 2021.
  2. Book: Integration of lazy learning associative classification with kNN algorithm . 2019 . 10.1109/ViTECoN.2019.8899415 . Tamrakar . Preeti . Roy . Siddharth Singha . Satapathy . Biswajit . Ibrahim . S. P. Syed . 1–4 . 978-1-5386-9353-7 .
  3. Bontempi . Gianluca . Birattari . Mauro . Bersini . Hugues . Lazy learning for local modelling and control design . International Journal of Control . 1 January 1999 . 72 . 7–8 . 643–658 . 10.1080/002071799220830.
  4. Book: Encyclopedia of Machine Learning. Sammut. Claude. Webb. Geoffrey I.. 2011. Springer Science & Business Media. 9780387307688. New York. 572.
  5. Book: Pal, Saurabh. Data Mining Applications. A Comparative Study for Predicting Student's Performance. 2017-11-02. GRIN Verlag. 9783668561458. en.
  6. Book: Combining Reinforcement Learning and Lazy Learning for Faster Few-Shot Transfer Learning . 2022 . 10.1109/Humanoids53995.2022.10000095 . Loncarevic . Zvezdan . Simonic . Mihael . Ude . Ales . Gams . Andrej . 285–290 . 979-8-3503-0979-9 .
  7. Book: Aha, David W.. Lazy Learning. 2013. Springer Science & Business Media. 9789401720533. Berlin. 106.