Fat tree explained

The fat tree network is a universal network for provably efficient communication.[1] It was invented by Charles E. Leiserson of the MIT in 1985. k-ary n-trees, the type of fat-trees commonly used in most high-performance networks, were initially formalized in 1997.[2]

In a tree data structure, every branch has the same thickness (bandwidth), regardless of their place in the hierarchy—they are all "skinny" (skinny in this context means low-bandwidth). In a fat tree, branches nearer the top of the hierarchy are "fatter" (thicker) than branches further down the hierarchy. In a telecommunications network, the branches are data links; the varied thickness (bandwidth) of the data links allows for more efficient and technology-specific use.

Mesh and hypercube topologies have communication requirements that follow a rigid algorithm, and cannot be tailored to specific packaging technologies.[3]

Applications in supercomputers

Supercomputers that use a fat tree network[4] include the two fastest as of late 2018,[5] Summit[6] and Sierra,[7] as well as Tianhe-2,[8] the Meiko Scientific CS-2, Yellowstone, the Earth Simulator, the Cray X2, the Connection Machine CM-5, and various Altix supercomputers.

Mercury Computer Systems applied a variant of the fat tree topology—the hypertree network—to their multicomputers. In this architecture, 2 to 360 compute nodes are arranged in a circuit-switched fat tree network. Each node has local memory that can be mapped by any other node. Each node in this heterogeneous system could be an Intel i860, a PowerPC, or a group of three SHARC digital signal processors.

The fat tree network was particularly well suited to fast Fourier transform computations, which customers used for such signal processing tasks as radar, sonar, and medical imaging.

Related topologies

In August 2008, a team of computer scientists at UCSD published a scalable design for network architecture[9] that uses a topology inspired by the fat tree topology to realize networks that scale better than those of previous hierarchical networks. The architecture uses commodity switches that are cheaper and more power-efficient than high-end modular data center switches.

This topology is actually a special instance of a Clos network, rather than a fat-tree as described above. That is because the edges near the root are emulated by many links to separate parents instead of a single high-capacity link to a single parent. However, many authors continue to use the term in this way.

Further reading

Notes and References

  1. Charles E. Leiserson . Charles E . Leiserson . Fat-trees: universal networks for hardware-efficient supercomputing . IEEE Transactions on Computers . 34 . 10 . 892–901 . October 1985 . 10.1109/TC.1985.6312192 . 8927584 .
  2. Book: Petrini, Fabrizio . Proceedings 11th International Parallel Processing Symposium . K-ary n-trees: High performance networks for massively parallel architectures . 1997 . https://ieeexplore.ieee.org/document/580853 . doi: 10.1109/IPPS.1997.580853. . 87–93. 10.1109/IPPS.1997.580853 . 0-8186-7793-7 . 6608892 .
  3. Book: Charles E. . Leiserson . Zahi S. . Abuhamdeh . David C. . Douglas . Carl R. . Feynman . Mahesh N. . Ganmukhi . Jeffrey V. . Hill . W. . Daniel Hillis . Bradley C. . Kuszmaul . Margaret A. . St. Pierre . David S. . Wells . Monica C. . Wong . Shaw-Wen . Yang . Robert . Zak . The Network Architecture of the Connection Machine CM-5 . SPAA '92 Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures . 272–285 . ACM . 1992 . 978-0-89791-483-3 . 10.1145/140901.141883 . 6307237 . https://dl.acm.org/citation.cfm?doid=140901.141883.
  4. Book: Yuefan Deng . Yuefan Deng . Applied Parallel Computing . 3.2.1 Hardware systems: Network Interconnections: Topology . https://books.google.com/books?id=YS9wvVeWrXgC&pg=PA25 . 2013 . World Scientific . 978-981-4307-60-4 . 25.
  5. Web site: November 2018 TOP500. November 2018. 2019-02-11. TOP500.
  6. Web site: Summit - Oak Ridge National Laboratory's next High Performance Supercomputer. 2019-02-11. Oak Ridge Leadership Computing Facility.
  7. Web site: Using LC's Sierra Systems - Hardware - Mellanox EDR InfiniBand Network - Topology and LC Sierra Configuration. 2019-01-18. 2019-02-11. Barney. Blaise. Lawrence Livermore National Laboratory.
  8. Web site: Visit to the National University for Defense Technology Changsha, China . 2013-06-03 . 2013-06-17 . Dongarra . Jack . Netlib.
  9. Book: Mohammad . Al-Fares . Alexander . Loukissas . Amin . Vahdat . A scalable, commodity data center network architecture . Proceedings of the ACM SIGCOMM 2008 conference on Data communication . 978-1-60558-175-0 . 63–74 . 2008 . 10.1145/1402958.1402967 . ACM . 65842 . http://www.cs.kent.edu/~javed/class-CXNET09S/papers-CXNET-2009/FaLV08-DataCenter-interconnect-p63-alfares.pdf .