<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>Psych 221 Final Project: Effects of reflectances, spatial transformations and noise on linear color balancing</title>
<style type="text/css">
  body 
  {
    padding-left: 100px;
    padding-right: 100px;
    color: black;
    background-color: #ffffcc; 
  }
  h4
  {
    font-size: 1.5em;
    background-color: #ccc;
    margin: 1em;
    padding: 3em;
  }
</style>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
</head>

<body>
<h2 style="text-align: center;"><em>Psych 221: Applied Vision and Image Systems</em></h2>
<h2 style="text-align: center;"><em>Effects of reflectances, spatial transformations and noise on color balancing</em></h2>
<h3 style="text-align: center;"><em>Bragi Sveinsson, Spring 2009-2010</em></h3> <br><br>

<h2>Introduction</h2>
<p>
In image processing, the appearance of a particular scene can be described as the combination of three factors: The reflectances in the scene, the scene 
illuminant (or lighting) and the device used for capturing the scene.</p>

<p> A scene pictured under different lightings can show considerable color differences. The human visual system
is able to correct for this to a large extent, so that an object viewed under daylight looks similar when it 
is viewed under a lightbulb, for example. However, when a camera is used to take pictures of the object under
the two illuminants, the differences can become very apparent.
</p>

<p> The art of color balancing deals with correcting for these differences. Many different types of color balancing
algorithms have been developed and they vary greatly in their complexity. An example of a simple algorithm is the
gray world algorithm<sup>1</sup>, where the RGB components of an image are scaled to have the same mean value. Examples of more
sophisticated algorithms are neural networks<sup>2</sup> and Retinex<sup>3</sup> methods.
</p>

<p> In this project, color balancing was done on a scene consisting of a variable number of reflectances illuminated
by tungsten light by finding a matrix which, when multiplied with each reflectance, gave the best average fit to
what the reflectances would look like under D65 light. After having determined the matrix from this set of "training"
reflectances, the matrix was then tested on a separate "testing" reflectance. The effects of altering the size of the
training set were examined, as well as the effect of adding noise to the training set and performing the matrix fitting
in XYZ or Lab space.
</p>

<hr>

<h2> Methods </h2>

<p> A Matlab application was designed to aid with the experiment as shown below.
</p>

<p><img src="application.jpg" width="900" height="500"></p>

<p>The application works as follows: The user chooses the actual scene lighting in the "light" drop-down
menu and an ideal light from the drop-down menu below. Typically these are tungsten and D65 lighting, respectively.
Then a set of training reflectances is chosen, either by specifying a number of reflectances and then having the application
randomly choose which ones to use, or by manually choosing each reflectance. A test reflectance is also chosen in the
drop-down box on the right.</p>

<p> The option is also given to do the matrix estimation in Lab or XYZ space and to add noise to the reflectance set. Both
of these options will be explained in better detail later, but to begin with, we assume that the simplest case of no noise and
matrix estimation in XYZ space has been chosen. The parameters for the matrix estimation will then be as follows:
</p>

<ul>
<li><b>R<sub>1</sub></b>, <b>R<sub>2</sub></b>, ..., <b>R<sub>m</sub></b>: Vectors of dimension 1xn, each vector <b>R<sub>i</sub></b> representing the i'th training reflectance. 
The values give the signal intensities in a linearly spaced wavelength range from 380nm to 784nm with spacing
of 4nm (we thus have 101 wavelength values, i.e. n=101).
<li><b>L<sub>actual</sub></b>, <b>L<sub>ideal</sub></b>: Matrices of dimension nxn, representing the actual light which is being corrected for and
the ideal light, respectively.
<li><b>XYZ<sub>conv</sub></b>: A matrix of dimension nx3, where each column consists of the <font style="text-decoration: overline;">x</font>, 
<font style="text-decoration: overline;">y</font> and <font style="text-decoration: overline;">z</font> function values
at the reflectance wavelengths as defined by CIE.
<li><b>sRGB</b>: A matrix of dimension 3x3, a common matrix for converting XYZ values to RGB values under D65 lighting
for standard displays, designed by HP and Microsoft
</ul>

<p>The XYZ values of a reflectance <b>R<sub>i</sub></b> under each lighting can then be calculated by the formulas 
<b>XYZ<sub>i-actual</sub></b> = <b>R<sub>i</sub></b><sup>T</sup>*<b>L<sub>actual</sub></b>*<b>XYZ<sub>conv</sub></b> and 
<b>XYZ<sub>i-ideal</sub></b> = <b>R<sub>i</sub></b><sup>T</sup>*<b>L<sub>ideal</sub></b>*<b>XYZ<sub>conv</sub></b>.
</p>

<p> To display the resulting vector, it is subsequently converted to RGB by multiplying with the <b>sRGB</b> matrix. The
matrix <b>XYZ<sub>conv</sub></b>*<b>sRGB</b><sup>T</sup> can be thought of as a device matrix, converting to the RGB values of a standard display.
</p>

<p> We now want to find a 3x3 color balancing matrix <b>CBM</b> such that <b>XYZ<sub>i-balanced</sub></b> = <b>CBM</b>*<b>XYZ<sub>i-actual</sub></b> gives, on average, 
a reasonably good approximation for <b>XYZ<sub>i-ideal</sub></b> for all the reflectances. This can be done by defining
<b>XYZ<sub>all-actual</sub></b> = [ <b>XYZ<sub>1-actual</sub></b> <b>XYZ<sub>2-actual</sub></b> ... <b>XYZ<sub>m-actual</sub></b>] and
<b>XYZ<sub>all-ideal</sub></b> = [ <b>XYZ<sub>1-ideal</sub></b> <b>XYZ<sub>2-ideal</sub></b> ... <b>XYZ<sub>m-ideal</sub></b> ] and defining the color balancing matrix by
<b>CBM</b> = <b>XYZ<sub>all-ideal</sub></b>*(<b>XYZ<sub>all-actual</sub></b><sup>T</sup>*(<b>XYZ<sub>all-actual</sub></b>*<b>XYZ<sub>all-actual</sub></b><sup>T</sup>)<sup>-1</sup>), which should give the minimum square
error in the estimate. The resulting matrix is then used to color balance the test reflectance in the same way:
<b>XYZ<sub>test-balanced</sub></b> = <b>CBM</b>*<b>XYZ<sub>test-actual</sub></b>. The color balancing matrix is shown at the bottom of the Matlab application.</p>

<p> To estimate how closely the result of our color balancing resembles the ideal reflectance, we convert both <b>XYZ<sub>i-balanced</sub></b>
and <b>XYZ<sub>i-ideal</sub></b> to Lab space (the reference white point is the white patch of the Macbeth color checker under ideal light)
and find the Euclidean distance between the two vectors, giving a ΔE value. The resulting ΔE values for the training
set and the test reflectance are shown right below the corresponding color images in the Matlab application.
</p>

<p> An interesting topic that comes into mind is whether better results (i.e. lower ΔE values) would be obtained by
creating our conversion matrix in the Lab space, i.e. if we convert each <b>XYZ<sub>i-actual</sub></b> and <b>XYZ<sub>i-ideal</sub></b> to the corresponding
Lab values <b>Lab<sub>i-actual</sub></b> and <b>Lab<sub>i-ideal</sub></b> and find the matrix that most efficiently converts the former to the latter, just as described
above for the XYZ values. This gives us a color balanced vector <b>Lab<sub>i-balanced</sub></b> which can then be converted back to XYZ space
and subsequently displayed. To help visualize the two different methods, the below figure is helpful:
</p>
<p><img src="Transform.jpg" width="800" height="440"></p>

<p> Another point of interest is what effect, if any, it has to increase the number of reflectances in our
 training set, i.e. to increase m. On one hand, one might argue that by using more reflectances to create our
 color balancing matrix, we are "spanning" a larger fraction of all possible colors, thus doing better on average
 for a random test reflectance. On the other hand, by trying to fit a linear transformation to more reflectance 
 vectors, the precision of each transformation will be worse.
</p>

<p> Finally, it is interesting to examine the effects of adding noise to the reflectance vectors before the color
balancing is done. This could be viewed as the effect of using a set of known reference colors in an image (such as
a Macbeth color checker) to determine the appropriate color balancing method, but unbeknownst to the user, the
reference colors have become smudged and tainted over time and thus would not resemble their ideal counterparts
even under the appropriate light, such as D65.
</p>

<hr>

<h2> Results </h2>

<h3> Effect of size of training set </h3>

<p> We begin with examining the effect of increasing the number of reflectances in our training set, while
doing our linear transformation in XYZ space with no noise. The color balancing matrix was generated with a
training set size of 5, 6, 7, 8, 9, 10, 12, 14, 16 and 20 reflectances and the resulting ΔE value for the training set and for
the test reflectance recorded. This was done 50 times for each set size and the average values plotted.
</p>
<p><img src="TrXNN.jpg" width="600" height="330"></p>
<p><img src="TeXNN.jpg" width="600" height="330"></p>

<p>The upper plot shows the average ΔE value for the training set. The plot seem to show that as the 
number of training reflectances increases, the average ΔE value for
the training set increases, indicating that as we try to fit the matrix transformation to more reflectances, the
fit for each reflectance becomes worse on average.</p>
<p> The lower plot shows that as we use a larger range of reflectances to create the color balancing matrix, it
does on average a better job of color balancing a new test reflectance, which makes intuitive sense.
</p>


<h3> Matrix estimation in Lab or XYZ space </h3>

<p>
As described above, here we are interested in whether we achieve lower values of ΔE by doing a linear transformation
in XYZ space and comparing the Lab value of the result to the Lab value of the ideal reference, or if it is better
to convert the reflectance values first to Lab space and do the best linear transformation there. As before, 50 tests
were performed for training sets with 5, 6, 7, 8, 9, 10, 12, 14, 16 and 20 reflectances, but this time the matrix estimation was done
in Lab space. The results for the training set (upper plot) and the test reflectance (lower plot) are shown below, along with the
XYZ values from before for comparison.
</p>
<p><img src="TrLXNN.jpg" width="600" height="330"></p>
<p><img src="TeLXNN.jpg" width="600" height="330"></p>

<p>The plots indicate that constructing the color balancing matrix in XYZ space gives much better results
than in Lab space, whether we are looking at the training set or the test reflectance. This seems somewhat
counterintuitive, since the matrix estimation in Lab space is designed to produce the matrix which gives the
smallest ΔE between <b>Lab<sub>all-actual</sub></b> and <b>Lab<sub>all-ideal</sub></b>. However, one must be careful comparing the two approaches,
since the conversion between XYZ space and Lab space is nonlinear, and therefore constructing the color
balancing matrix in XYZ space and then converting to Lab space is a nonlinear transform, seen from Lab space. Another
way to look at this is that if we create the matrix in Lab space, we have already "locked in" a certain amount of
error between the measured vectors and the target vectors in XYZ space, and therefore end up with a larger error. Furthermore,
the error for the balancing of the test reflectance gets stronger as we train with more reflectances in Lab space, which seems
somewhat counterintuitive. These results seem to indicate that creating the color balancing matrix in Lab space in general does not
produce good results.</p>


<h3> Effect of noise </h3>

<p> Another point of interest is to see what happens when we add noise to the reflectances. As described above, this
can be viewed as using something akin to a Macbeth color checker in an image to get a reference for color balancing, but
the color checker being dirty, worn out or otherwise damaged. The below image show the results of the same
color balancing methods as before, but this time with added Poisson noise. The top image shows results for the training
set with no noise, medium strength noise and strong noise. The bottom image shows the same for the test reflectance.
The definition of noise strength was somewhat arbitrary, medium noise was defined as having standard deviation equal to 5% of the 
average value of the average reflectance and strong noise as having variance 10% of that same average value. These values are
not significant in themselves, since we are only interested in seeing the trends in color balancing effectiveness as the
noise gets stronger.</p>

<p><img src="TrLXVN.jpg" width="600" height="330"></p>
<p><img src="TeLXVN.jpg" width="600" height="330"></p>

<p> The plots show that the results of the color balancing gets worse as the noise gets stronger, and that
the matrix calculation in XYZ is very sensitive to noise effects. Strangely, the Lab method seems to give slightly
better results with stronger noise, but nonacceptable nonetheless. In fact, with strong
noise, the Lab calculation becomes preferable to the XYZ calculation when balancing the test reflectance, since
the XYZ method starts yielding such high error values that it can almost be viewed as useless.</p>


<h3> Mapping of white point </h3>

<p>One way of getting a sense of the variability of the color balancing matrix is to see where the white
point maps for each estimate. This is done in the plots below, where each point represents the x, y values
of the Macbeth white point, obtained by converting from tungsten to D65 with the matrix calculated in XYZ for
all of the training set sizes. This was done for no noise, medium noise and strong noise,
represented in the top, middle and bottom images, respectively.</p>
<p><img src="WXNN.jpg" width="600" height="330"></p>
<p><img src="WX5N.jpg" width="600" height="330"></p>
<p><img src="WX10N.jpg" width="600" height="330"></p>

<p>The plots seem to show two trends. First, as noise gets stronger, the points get more spread out. Second,
this behavior seems to be more extreme for small training sets.</p>

<hr>

<h2> Conclusions </h2>

<p>In this project, we have examined a relatively simple color balancing algorithm - finding the linear transformation
that gives the least squares fit of a set of training reflectances to their values under ideal conditions. We
have examined the effect of the training set size, the effect of noise and whether it is better to perform the least
squares fit in XYZ space or in Lab space. The conclusions are as follows:
</p>

<ul>
<li> A larger training set gives better results when the matrix fitting is done in XYZ space under no noise.
<li> Performing the matrix fit in XYZ space yields better results than in Lab space, except under very strong noise.
<li> Noise added to the reflectance values is very detrimental to this method when the matrix calculation is done in XYZ space.
</ul>

<p> Furthermore, a Matlab application was developed to help with testing this method under various parameter settings.</p>

<hr>

<h2> Acknowledgements </h2>
Many thanks to Dr. Brian Wandell, Dr. Joyce Farrell, Dr. Steve Lansel and Reno Bowen for helping with understanding the
material and learning to use the ISET software. I also learned a great amount about color balancing from the previous year's
projects on color balancing by Daniel Chang<sup>4</sup> and Srinivasa Rangan Sridharan<sup>5</sup>. All reflectance and illuminant values were
obtained from the project of Daniel Chang, who in turn obtained them from Dr. Kobus Barnard<sup>6</sup>.

<h2> References</h2>
[1]: Kobus Barnard, Lindsay Martin, Adam Coath, and Brian Funt: A Comparison of Computational Color
Constancy Algorithms; Part Two: Experiments with Image Data. IEEE transactions on Image Processing, 2002.
<br>
[2]: Brian Funt, Vlad Cardei and Kobus Barnard: Learning color constancy. Proceedings of the Fourth IS&T/SID Color Imaging Conference, p 58-60, 1996.
<br>
[3]: Edwin H. Land and John J. McCann: Lightness and Retinex Theory. Journal of the Optical society of America, 1971.
<br>
[4]: Webpage by Daniel Chang: <a href = "http://scien.stanford.edu/pages/labsite/2009/psych221/projects/09/chang/index.html" >Color Loader</a>
<br>
[5]: Webpage by Srinivasa Rangan Sridharan: <a href = "http://scien.stanford.edu/pages/labsite/2009/psych221/projects/09/rangan/index.html" >Color Balancing Using Color by Correlation</a>
<br>
[6]: Webpage by Dr. Kobus Barnard: <a href = "http://kobus.ca/research/data/colour_constancy_synthetic_test_data/index.html" >Synthetic Data for Computatational Colour Constancy Experiments</a>

<h2> Appendix </h2>

<p> <a href = "Presentation.odp"> Project presentation </a> (this presentation contained a couple of errors in ΔE calculations, furthermore the scope of the
testing was later increased from 5 training set sizes to 10 and from 15 trials to 50, so this presentation is a bit outdated. The information displayed on this 
page is more relevant). </p>

<p> <a href = "project_code.zip" > Source code </a>. (requires the ISET functions <em>ieNotDefined.m,
lab2xyz.m</em> and <em>vcXYZ2lab.m</em> ) </p> 
                                       

</body>
</html>