Seam carving explained
Seam carving (or liquid rescaling) is an algorithm for content-aware image resizing, developed by Shai Avidan, of Mitsubishi Electric Research Laboratories (MERL), and Ariel Shamir, of the Interdisciplinary Center and MERL. It functions by establishing a number of seams (paths of least importance) in an image and automatically removes seams to reduce image size or inserts seams to extend it. Seam carving also allows manually defining areas in which pixels may not be modified, and features the ability to remove whole objects from photographs.
The purpose of the algorithm is image retargeting, which is the problem of displaying images without distortion on media of various sizes (cell phones, projection screens) using document standards, like HTML, that already support dynamic changes in page layout and text but not images.[1]
Image Retargeting was invented by Vidya Setlur, Saeko Takage, Ramesh Raskar, Michael Gleicher and Bruce Gooch in 2005.[2] The work by Setlur et al. won the 10-year impact award in 2015.
Seams
Seams can be either vertical or horizontal. A vertical seam is a path of pixels connected from top to bottom in an image with one pixel in each row.[1] A horizontal seam is similar with the exception of the connection being from left to right. The importance/energy function values a pixel by measuring its contrast with its neighbor pixels.
Process
The below example describes the process of seam carving:
The seams to remove depends only on the dimension (height or width) one wants to shrink. It is also possible to invert step 4 so the algorithm enlarges in one dimension by copying a low energy seam and averaging its pixels with its neighbors.[1]
Computing seams
Computing a seam consists of finding a path of minimum energy cost from one end of the image to another.This can be done via Dijkstra's algorithm, dynamic programming, greedy algorithm or graph cuts among others.[1]
Dynamic programming
Dynamic programming is a programming method that stores the results of sub-calculations in order to simplify calculating a more complex result. Dynamic programming can be used to compute seams. If attempting to compute a vertical seam (path) of lowest energy, for each pixel in a row we compute the energy of the current pixel plus the energy of one of the three possible pixels above it.
The images below depict a DP process to compute one optimal seam.[1] Each square represents a pixel, with the top-left value in red representing the energy value of that pixel. The value in black represents the cumulative sum of energies leading up to and including that pixel.
The energy calculation is trivially parallelized for simple functions. The calculation of the DP array can also be parallelized with some interprocess communication. However, the problem of making multiple seams at the same time is harder for two reasons: the energy needs to be regenerated for each removal for correctness and simply tracing back multiple seams can form overlaps. Avidan 2007 computes all seams by removing each seam iteratively and storing an "index map" to record all the seams generated. The map holds a "nth seam" number for each pixel on the image, and can be used later for size adjustment.[1]
If one ignores both issues however, a greedy approximation for parallel seam carving is possible. To do so, one starts with the minimum-energy pixel at one end, and keep choosing the minimum energy path to the other end. The used pixels are marked so that they are not picked again.[3] Local seams can also be computed for smaller parts of the image in parallel for a good approximation.[4]
Issues
- The algorithm may need user-provided information to reduce errors. This can consist of painting the regions which are to be preserved. With human faces it is possible to use face detection.
- Sometimes the algorithm, by removing a low energy seam, may end up inadvertently creating a seam of higher energy. The solution to this is to simulate a removal of a seam, and then check the energy delta to see if the energy increases (forward energy). If it does, prefer other seams instead.[5]
Implementations
File:Broadway_tower_seam_carving_interactive.svg|thumb|250px|Interactive SVG demonstrating seam-carving using ImageMagick's liquid-rescale function. In the SVG file, hover over the percentages to compare the original image (top), its width rescaled to the percentage using seam-carving (middle), and rescaled to the same size using interpolation (bottom).default http://upload.wikimedia.org/wikipedia/commons/b/bf/Broadway_tower_seam_carving_interactive.svgFile:Creation_of_Adam_seam_carving_interactive.svg|thumb|250px|Interactive SVG demonstrating seam-carving using ImageMagick's liquid-rescale function. In the SVG file, hover over the percentages as above. Note that the faces are affected less than their surroundings.default http://upload.wikimedia.org/wikipedia/commons/5/53/Creation_of_Adam_seam_carving_interactive.svgAdobe Systems acquired a non-exclusive license to seam carving technology from MERL,[6] and implemented it as a feature in Photoshop CS4, where it is called Content Aware Scaling.[7] As the license is non-exclusive, other popular computer graphics applications (e. g. GIMP, digiKam, and ImageMagick) as well as some stand-alone programs (e. g. iResizer)[8] also have implementations of this technique, some of which are released as free and open source software.[9] [10] [11]
Improvements and extensions
- Better energy function and application to video by introducing 2D (time+1D) seams.[5]
- Faster implementation on GPU.[4]
- Application of this forward energy function to static images.[12]
- Multi-operator: Combine with cropping and scaling.[13]
- Much faster removal of multiple seams[14]
A 2010 review of eight image retargeting methods found that seam carving produced output that was ranked among the worst of the tested algorithms. It was, however, a part of one of the highest-ranking algorithms: the multi-operator extension mentioned above (combined with cropping and scaling).[15]
See also
External links
Notes and References
- Book: Avidan . Shai . Shamir . Ariel . ACM SIGGRAPH 2007 papers . Seam carving for content-aware image resizing . July 2007 . 10 . 10.1145/1275808.1276390 . 978-1-4503-7836-9 . en . free.
- Book: Vidya Setlur . Saeko Takage . Ramesh Raskar . Michael Gleicher . Bruce Gooch . Proceedings of the 4th international conference on Mobile and ubiquitous multimedia - MUM '05 . Automatic image retargeting . December 2005 . 59–68 . 10.1145/1149488.1149499 . 0-473-10658-2 . EN . free.
- Web site: Bist . Palakkode . 2016 . Parallel Seam Carving . www.andrew.cmu.edu.
- Chen-Kuo Chiang . Shu-Fan Wang . Yi-Ling Chen . Shang-Hong Lai . Fast JND-Based Video Carving With GPU Acceleration for Real-Time Video Retargeting . IEEE Transactions on Circuits and Systems for Video Technology . November 2009 . 19 . 11 . 1588–1597 . 10.1109/TCSVT.2009.2031462. 15124131 .
- http://www.faculty.idc.ac.il/arik/site/seam-video.asp Improved Seam Carving for Video Retargeting.
- https://archive.today/20130201110145/http://www.reuters.com/article/pressRelease/idUS175954+16-Dec-2008+BW20081216 Mitsubishi Electric press release
- http://www.photoshopsupport.com/photoshop-cs4/what-is-new-in-photoshop-cs4.html Adobe Photoshop CS4 new feature list
- http://www.iresizer.com iResizer Content aware image resizing software by Teorex
- http://liquidrescale.wikidot.com/en:examples Liquid Rescale
- http://www.digikam.org/node/439 Announcement of inclusion
- http://www.imagemagick.org/Usage/resize/#liquid-rescale Seam carving capability included
- Web site: Improved seam carving with forward energy.
- https://faculty.runi.ac.il/arik/site/multi-op.asp Multi-operator Media Retargeting.
- http://vmcl.xjtu.edu.cn/Real-Time%20Content-Aware%20Image%20Resizing.files/real_time_content_aware_image_resizing.pdf Real-time content-aware image resizing
- A Comparative Study of Image Retargeting. ACM Transactions on Graphics. 29. 5. 2010. Michael . Rubinstein. Diego . Gutierrez. Olga . Sorkine. Ariel . Shamir. 1–10. 10.1145/1882261.1866186. See also the RetargetMe benchmark.