Hybrid Images


rogerngo.com / Notebooks / Hybrid Images

Roger Ngo, rngo2@illinois.edu

Part 1

Favorite Result

My favorite result was a hybrid image between former President Barack Obama and First Lady Michelle Obama.

The low frequency image chosen is President Barack Obama. The high frequency image is Michelle Obama.

Barack Obama (Original)
Michelle Obama (Original)
Barack Obama (Filtered), sigma=15
Michelle Obama (Filtered), sigma=10
"Mirack Obama" Hybrid Result
"Mirack Obama" Hybrid Result
(10x Downsample)
FFT Barack Obama (Pre-Filter)
FFT Michelle Obama (Pre-Filter)
FFT Low + FFT High = FFT Hybrid

The result of this picture turned out quite well. I used a sigma_low value of 15 and sigma_high value of 10 for the low, and high pass frequency cutoffs respectively.

When viewed close up, the hybrid image exhibits more of the high frequency characteristics. In this case, a lot of the image involving Michelle Obama is displayed. As one steps back further away from the screen, they will start to see more of the low frequency characteristics of the image. In this case, more of Barack Obama.

I have included a 10x downsampled hybrid image to simulate the effect of stepping further away from the screen.

There are a few reasons as to why I think this result turned out well.

  • Both images were aligned very well without any modification.
  • Barack Obama chosen as a low frequency image is a good choice in that he is still a recognizable figure when viewed from afar.
  • Michelle Obama is a good high frequency image because of some of the strong details exhibited in the original image such as necklace, jewelry, facial make up and other things that make her "visibly female".

Attribution:

  • Barack Obama: Wikipedia
  • Michelle Obama: Biography.com

Additional Results

Lion and Tiger

Tiger (Original)
Lion (Original)
"Liger" Hybrid Result, sigma_low=10; sigma_high=7
"Liger" Hybrid Result
(10x Downsampled)

I had to tweak the parameters here to have sigma_low = 10 for low frequency cutoff and sigma_high = 7 for the high frequency cutoff.

I had initially assumed that the Tiger would not serve as a good low frequency component for the hybrid image. The main reason was being that the stripes may have provided too much detail that would not display well when downsampled. However, I was wrong here and that the image came out fine. I think it is because tiger stripes are easily recognizable when viewed from afar. The lion was a good choice for high frequency component of the image because of all the hair on the mane.

Attribution:

  • Lion: Scientific American
  • Tiger: National Zoo

Roger and Alexa

Roger (Original)
Alexa (Original)
"Alexer" Hybrid Result, sigma_low=20; sigma_high=10
"Alexer" Hybrid Result
(10x Downsampled)

Here are pictures of myself and my wife. It was taken at the beach a few years ago. Using myself as the low frequency image was easy as in comparison, I do not have make up on my face. Since I have less detail and am still recognizable as an adult male.

Again, the frequency cutoffs had to be adjusted. sigma_low = 20 for the low frequency image, and sigma_high = 10 for the high frequency image.

Attribution:

  • Roger: (self)
  • Alexa: (wife)

Part 2

Contrast Enhancement

Original Image
Contrast Enhanced Image

I had used one of my wedding photos, and cropped it for this picture.

The method I had used to perform contrast enahncement is to first convert the image into the HSV color space.

I used the cv2.equalizeHist function to perform the equalization.

Then, histogram equalization is applied on the luminance (V) channel of HSV as that corresponds to brightness.

Once contrast has been adjusted within the luminance (V) channel, I followed it up by saturating the colors more through modifying the S (saturation) channel. Finally, I apply a simple blur filter to reduce the amount of noise.

Color Enhancement

Original Image
Color Enhanced Image

Here is a picture of my daughter while she was still an infant.

The image was processed within the RGB color space. The method chosen to color enhance was using the gray world technique.

The method to perform the color enhancement is as follows:

  • Take the average of each value in every color channel, obtaining r_avg, g_avg and b_avg.
  • Obtain the total sum of all the values in each channel
  • Obtain the average pixel value by dividing the sum from all channels and the total number of pixels from all channels.
  • Using the average, we divide this by the average values found in each channel to obtain r_alpha, g_alpha and b_alpha. We can think of these as weights
  • Then for each pixel in each channel, multiply the value by the corresponding alpha value to obtain the new value.

Color Shift

Original Image
More Red
Less Yellow

We can manipulate the colors in an image to give a specific color temperature. For example, by working with the image in the Lab color space, I can make the image more red, or less yellow by adjusting the a, or b channels.

To make the image more red, I can adjust the a channel. I pick a value in this channel that is some threshold value. In this case, pixel values with 120. This would be the threshold. For all values in the a channel, if the pixel value is greater than 120, then it will be adjusted by some factor to make it more red.

The process is similar with making the image less yellow. Instead of addition, it will be subtraction in the b channel.

Bells and Whistles

Color Hybrid Images

Color Hybrid with both images in color
Color Hybrid with both images in color
(10x downsampled)
Color Hybrid with low frequency in color, high frequency in grayscale
Color Hybrid with low frequency in color, high frequency in grayscale
(10x downsampled)
Color Hybrid with low frequency in grayscale, and high frequency in color
Color Hybrid with low frequency in grayscale, and high frequency in color
(10x downsampled)

For the color hybrid image, I had to tweak the cut off frequencies in order for the colors to be merged properly in the final hybrid image. Otherwise, there will be a lot of artifacts appearing.

It seems to be the case that the low frequency component will always dominate the picture. So, either having the low frequency only as color, or both as color will yield a color result. Having the low frequency as grayscale and high frequency as color will still produce a seemingly grayscale hybrid.

Then, it may make sense to use a grayscale image for the high frequency component of the hybrid image as only edges are retained when a high pass filter is applied on it.

Gaussian and Laplacian Pyramids

Original Size
2x downsampled
4x downsampled
8x downsampled
Original Size
2x downsampled
4x downsampled
8x downsampled

For the Laplacian generated images, it is important to remember that pixel values may fall into the negative range. In order to accommodate for this, and ensure that edges and background are visible, I shifted each pixel by 0.5 before outputting. This makes it easier to see the edges in the pictures, in general.


Last modified: January 2021, Roger Ngo, rngo2@illinois.edu

To Top